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COMPOUNDS THAT INHIBIT INTERACTION BETWEEN SIGNAL-TRANSDUCING PROTEINS 
AND THE GLGF (PDZ/DHR) DOMAIN AND USES THEREOF 



The invention disclosed herein was made with Government 
support under Grant No. R01GM55147-01 from the National 
Institutes of Health of the United States Department of 
10 Health and Human Services. Accordingly, the U.S. 

Government has certain rights in this invention. 

BACKGROUND 

15 Throughout this application, various publications are 

referenced by author and date. Full citations for these 
publications may be found listed alphabetically at the 
end of the specification immediately preceding Sequence 
Listing and the claims. The disclosures of these 

2 0 publications in their entireties are hereby incorporated 

by reference into this application in order to more fully 
describe the state of the art as known to those skilled 
therein as of the date of the invention described and 
claimed herein. 

25 

Fas (APO-1/CD95) and its ligand have been identified as 
important signal-mediators of apoptosis (Itoh, et al . 
1991) The structural organization of Fas (APO-1/CD95) has 
suggested that it is a member of the tumor necrosis 

3 0 factor receptor superfamily, which also includes the p75 

nerve growth factor receptor (NGFR) (Johnson, et al . 
1986) , the T-cell-activation marker CD27 (Camerini, et 
al . 1991), the Hodgkin- lymphoma -associated antigen CD30 

(Smith, et al . (1993), the human B cell antigen CD40 
35 (Stamenkovic, et al . 1989), and T cell antigen OX40 

(Mallett, et al . 1990) . Genetic mutations of both Fas 
and its ligand have been associated with 

lymphoprol iterative and autoimmune disorders in mice 

(Watanabe-Fukunaga, et al . 1992; Takahashi, et al . 1994). 



40 
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Furthermore, alterations of Fas expression level have 
been thought to lead to the induction of apoptosis in 
T-cells infected with human immunodeficiency virus (HIV) 
(Westendorp, et al . 1995). 

Several Fas- interacting signal transducing molecules, 
such as Fas-associated phosphatase- 1 (FAP-1) (Figure 1) 
(Sato, et al. 1995) FADD/M0RT1/CAP- l/CAP-2 (Chinnaiyan, 
et al. 1995; Boldin, et al . 1995; Kischkel, et al . 1995) 
and RIP (Stanger, et al . 1995), have been identified 
using yeast two-hybrid and biochemical approaches. All 
but FAP-1 associate with the functional cell death domain 
of Fas and overexpression of FADD/MORT1 or RIP induces 
apoptosis in cells transfected with these proteins. In 
contrast, FAP-1 is the only protein that associates with 
the negative regulatory domain (C- terminal 15 amino 
acids) (Ito, et al . 1993) of Fas and that inhibits 
Fas - induced apoptosis . 

FAP-1 (PTPN13) has several alternatively- spliced forms 
that are identical to PTP-BAS/hPTPlE/PTPLl , (Maekawa, et 
al. 1994; Banville, et al . 1994; Saras, et al . 1994) and 
contains a membrane -binding region similar to those found 
in the cytoskeleton-associated proteins, ezrin, (Gould et 
al, 1989) radixin (Funayama et al . 1991) moesin (Lankes, 
et al . 1991), neurofibromatosis type II gene product 
(NFII) (Rouleau, et al . 1993), and protein 4.1 (Conboy, 
et al . 1991), as well as in the PTPases PTPH1 (Yang, et 
al. 1991), PTP-MEG (Gu, et al . 1991), and PTPD1 (Vogel, 
et al . 1993). FAP-1 intriguingly contains six GLGF 
(PDZ/DHR) repeats that are thought to mediate intra- and 
inter-molecular interactions among protein domains. The 
third GLGF repeat of FAP-1 was first identified as a 
domain showing the specific interaction with the 
C-terminus of Fas receptor (Sato, et al . 1995). This 
suggests that the GLGF domain may play an important role 
in targeting proteins to the submembranous cytoskeleton 



WO 98/05347 



PCT/US97/12677 



-3 



10 



15 



20 



and/or in regulating biochemical activity. GLGF repeats 
have been previously found in guanylate kinases, as well 
as in the rat post -synaptic density protein (PSD- 95) (Cho, 
et al . 1992), which is a homolog of the Drosophila tumor 
suppressor protein, lethal- (1) -disc-large-1 [dlg-1] 
(Woods, et al 1991; Kitamura, et al . 1994). These 
repeats may mediate homo- and hetero-dimerization, which 
could potentially influence PTPase activity, binding to 
Fas, and/or interactions of FAP-1 with other signal 
transduction proteins. Recently, it has also been 
reported that the different PDZ domains of proteins 
interact with the C- terminus of ion channels and other 
proteins (Figure 1) (TABLE 1) (Kornau, et al . 1995; Kim, 
et al . 1995; Matsumine, et al . 1996). 

TABLE 1. Proteins that interact with PDZ domains. 



Protein 


C- terminal 
sequence 


Associated 
protein 


Reference 


Fas (APO-1/CD95) 


SLV 


FAP-1 


2 


NMDA receptor 
NR2 subunit 


SDV 


PSD95 


3 


Shaker- type K+ 
channel 


TDV 


PSD95 & DLG 


4 


APC 


TEV 


DLG 


5 



# 
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SUMMARY OF THE INVENTION 

This invention provides a composition capable of 
inhibiting specific binding between a signal -transducing 
protein and a cytoplasmic protein containing the amino 
acid sequence (G/S/A/E) -L-G- (F/I/L) (Sequence I.D. No.: 
1) . Further, the cytoplasmic protein may contain the 
amino acid sequence (K/R/Q) -X n - (G/S/A/E) -L-G- (F/l/L) 
(Sequence I.D. No.: 2), wherein X represents any amino 
acid which is selected from the group comprising the 
twenty naturally occurring amino acids and n represents 
at least 2, but not more than 4. In a preferred 
embodiment, the amino acid sequence is SLGI (Sequence 
I.D. No.: 3). Further, the invention provides for a 
composition when the signal -transducing protein has at 
its carboxyl terminus the amino acid sequence (S/T) -X- 
(V/I/L) (Sequence I.D. No.: 4), wherein each - represents 
a peptide bond, each parenthesis encloses amino acids 
which are alternatives to one other, each slash within 
such parentheses separating the alternative amino acids, 
and the X represents any amino acid which is selected 
from the group comprising the twenty naturally occurring 
amino acids . 

This invention also provides for a method of identifying 
a compound capable of inhibiting specific binding between 
a signal- transducing protein and a cytoplasmic protein 
containing the amino acid sequence (G/S/A/E) -L-G- (F/I/L) . 
Further this invention provides for a method of 
identifying a compound capable of inhibiting specific 
binding between a signal- transducing protein having at 
its carboxyl terminus the amino acid sequence (S/T) -X- 
(V/L/I) and a cytoplasmic protein. 

This invention also provides for a method inhibiting the 
proliferation of cancer cells, specifically, where the 
cancer cells are derived from organs comprising the 
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colon, liver, breast, ovary, testis, lung, stomach, 
spleen, kidney, prostate, uterus, skin, head, thymus and 
neck, or the cells are derived from either T-cells or B- 
cells . 

5 

This invention also provides for a method of treating 
cancer in a subject in an amount of the composition of 
effective to result in apoptosis of the cells, 
specifically, where the cancer cells are derived from 
10 organs comprising the thymus, colon, liver, breast, 

ovary, testis, lung, stomach, spleen, kidney, prostate, 
uterus, skin, head and neck, or the cells are derived 
from either T-cells or B-cells. 

15 This invention also provides for a method of inhibiting 

the proliferation of virally infected cells, specifically 
wherein the virally infected cells are infected with the 
Hepatitis B virus, Epstein-Barr virus, influenza virus, 
Papilloma virus, Adenovirus, Human T-cell lymphtropic 

20 virus, type 1 or HIV. 

This invention also provides a pharmaceutical composition 
comprising compositions capable of inhibiting specific 
binding between a signal- transducing protein and a 
25 cytoplasmic protein. 

This invention also provides a pharmaceutical composition 
comprising compounds identified to be capable of 
inhibiting specific binding between a signal- transducing 
3 0 protein and a cytoplasmic protein. 
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BRIEF DESCRIPTION OF THE FIGURES 

Figure 1. Diagram of Fas-associated phosphatase-1 
protein, showing the six GLGF (PDZ/DHR) domain repeats; 
5 comparison of similar membrane binding sites with other 

proteins and proteins that contain GLGF (PDZ/DHR) 
repeats . 



Figures 2A # 2B, 2C and 2D. Mapping of the minimal region 
10 of the C- terminal of Fas required for the binding to 

FAP-1. Numbers at right show each independent clone 
(Figures 2C and 2D) . 

2A. Strategy for screening of a random peptide library 
by the yeast two-hybrid system. 
15 2B. Alignment of the C-terminal 15 amino acids of Fas 

between human (Sequence I.D. No.: 5) , rat (Sequence 
I.D. No.: 6), and mouse (Sequence I.D. No.: 7). 
2C. The results of screening a semi-random peptide 
library. Top row indicates the amino acids which 
2 0 were fixed based on the homology between human and 

rat. Dash lines show unchanged amino acids. 
2D. The results of screening a random peptide library 
(Sequence I.D. No. 
Sequence I.D. No. 
25 Sequence I.D. 

Sequence I.D. 
Sequence I.D, 
respectively) 



No. 
No. 
No, 



8, 


Sequence 


I 


.D . 


No . 


: 9, 


10, 


Sequence 


I . 


D. 


No. : 


11, 


12, 


Sequence 


I . 


D. 


No. : 


13, 


14, 


Sequence 


I . 


D. 


No. : 


15, 


16, 


Sequence 


I . 


D. 


No. : 


17, 



3 0 Figures 3A, 3B and 3C. Inhibition assay of Fas/FAP-1 

binding in vitro. 

3A. Inhibition assay of Fas/FAP-1 binding using the 
C-terminal 15 amino acids of Fas. GST-Fas fusion 
protein (191-355) was used for in vitro binding 
35 assay (lane 1, 3-10) . GST-Fas fusion protein 

(191-320) (lane- 2) and 1 mM human PAMP (N-terminal 
20 amino acids of proadrenomedullin, M.W. 2460.9) 
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(lane 3) were used as negative controls. The 
concentrations of the C-terminal 15 amino acids 
added were 1 (lane 4) , 3 (lane 5) , 10 (lane 6) , 3 0 
(lane 7), 100 (lane 8), 300 (lane 9), and 1000 fxM 
5 (lane 10) . 

3B. Inhibition assay of Fas/FAP-1 binding using the 
truncated peptides corresponding to the C-terminal 
15 amino acids of Fas. All synthetic peptides were 
acetylated for this inhibition assay (Sequence I.D. 
10 No.: 4, Sequence I.D. No.: 18, Sequence I.D. No.: 

19, Sequence I.D. No.: 20, Sequence I.D. No.: 21, 
Sequence I.D. No.: 22, Sequence I.D. No.: 23, 
respectively) . 

3C. Inhibitory effect of Fas/FAP-1 binding using the 
15 scanned tripeptides. 

Figures 4A, 4B, 4C and 4D. 

4A. Interaction of the C-terminal 3 amino acids of Fas 

with FAP-1 in yeast. 
20 4B. Interaction of the C-terminal 3 amino acids of Fas 

with FAP-1 in vitro. 
4C. Immuno-precipitation of native Fas with GST-FAP-1. 
4D. Inhibition of Fas/FAP-1 binding with Ac-SLV or Ac- 

SLY. 

25 

Figures 5A, 5B, 5C, 5D, 5E and 5F. Microinjection of 
Ac-SLV into the DLD-1 cell line. Triangles identify the 
cells both that were could be microinjected with Ac-SLV 
and that showed condensed chromatin identified. On the 
3 0 other hand, only one cell of the area appeared apoptotic 

when microinjected with Ac -SLY. 

5A. Representative examples of the cells microinjected 
with Ac-SLV in the presence of 500 ng/ml CH11 are 
shown in phase contrast . 
35 5B. Representative examples of the cells microinjected 

with AC-SLY in the presence of 500 ng/ml CH11 are 
shown in phase contrast . 
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5C. Representative examples of the cells microinj ected 
with Ac-SLV in the presence of 500 ng/ml CH11 are 
shown stained with FITC. 
5D. Representative examples of the cells microinj ected 
5 with AC-SLY in the presence of 500 ng/ml CH11 are 

shown stained with FITC. 
5E. Representative examples of the cells microinj ected 
with Ac-SLV in the presence of 500 ng/ml CH11 are 
shown with fluorescent DNA staining with Hoechst 
10 33342. 

5F. Representative examples of the cells microinj ected 
with AC-SLY in the presence of 500 ng/ml CH11 are 
shown in fluorescent DNA staining with Hoechst 
33342 . 

15 

Figure 6. Quantitation of apoptosis in microinj ected 
DLD-1 cells. 



Figures 7A, 7B, 7C, 7D, 7E, 7F # 7G, and 7H. 

2 0 7A. Amino acid sequence of human nerve growth factor 

receptor (Sequence I.D. No. : 24) . 
7B. Amino acid sequence of human CD4 receptor (Sequence 
I.D. No . 25). 

7C. The interaction of Fas-associated phosphatase-1 to 
25 the C-terminal of nerve growth factor receptor 

(NGFR) (p75) . 

7D. Amino acid sequence of human colorectal mutant 

cancer protein (Sequence I.D. No. : 26) . 
7E . Amino acid sequence of protein kinase C, alpha type. 
30 7F. Amino acid sequence of serotonin 2A receptor 

(Sequence I,D. No.: 27). 
7G. Amino acid sequence of serotonin 2B receptor 

(Sequence I.D. No.: 28). 
7H. Amino acid sequence of adenomatosis polyposis coli 
35 protein (Sequence I.D. No. : 29) . 
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Figure 8 . Representation of the structural 
characteristics of p75 NGFR (low-affinity nerve growth 
factor receptor) . 



5 Figure 9. Comparison of the C-terminal ends of Fas and 

p75 NGFR. 

Figure 10. In vitro interaction of 35 S-labeled FAP-1 with 
various receptors expressed as GST fusion proteins. The 

10 indicated GST fusion proteins immobilized on glutathione - 

Sepharose beads were incubated with in vitro translated , 
35 S- labeled FAP-1 protein. After the beads were washed, 
retained FAP-1 protein was analyzed by SDS-PAGE and 
autoradiography . 

15 

Figures 11A and 11B. In vitro interaction 35 S-labeled 
FAP-1 with GST-p75 deletion mutants. 

IIA. Schematic representation of the GST fusion 
proteins containing the cytoplasmic domains of 

20 p75 and p75 deletion mutants. Binding of FAP- 

1 to the GST fusion proteins with various p75 
deletion mutants is depicted at the right and 
is based on data from (11B) . 

IIB. Interaction of in vitro translated, 35 S- labeled 
25 FAP-1 protein with various GST fusion proteins 

immobilized on glutathione -Sepharose beads. 
After the beads were washed, retained FAP-1 
protein was analyzed by SDS-PAGE and 
autoradiography . 

30 

Figure 12 . The association between LexA-C- terminal 
cytoplasmic region of p75NGFR and VP16-FAP-1. The 
indicated yeast strains were constructed by 
transformation and the growth of colonies was tested. 
35 + /- indicates the growth of colonies on his " plate. 
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DETAILED DESCRIPTION OF THE INVENTION 



As used herein, amino acid residues are abbreviated as 

5 follows: A, Ala; C, Cys ; D; Asp; E, Glu ; F, Phe ; G, Gly; 

H, His; I, lie; K, Lys; L , Leu; M, Met; N, Asn; P, Pro; 

Q/ Gin; R, Arg; S, Ser; T, Thr; V, Val ; W, Trp; and Y, 
Tyr . 



10 In order to facilitate an understanding of the material 

which follows, certain frequently occurring methods 
and/or terms are best described in Sambrook, et al . , 
1989 . 



15 The present invention provides for a composition capable 

of inhibiting specific binding between a signal - 
transducing protein and a cytoplasmic protein containing 
the amino acid sequence (G/S/A/E) -L-G- (F/I/L) , wherein 
each - represents a peptide bond, each parenthesis 

20 encloses amino acids which are alternatives to one other, 

and each slash within such parentheses separating the 
alternative amino, acids. Further, the cytoplasmic 
protein may contain the amino acid sequence (K/R/Q) -X n - 
(G/S/A/E) -L-G- (F/I/L) , wherein X represents any amino acid 

25 which is selected from the group comprising the twenty 

naturally occurring amino acids and n represents at least 
2, but not more than 4. Specifically, in a preferred 
embodiment, the cytoplasmic protein contains the amino 
acid sequence SLGI . 

30 

The amino acid sequence (K/R/Q) -X n - (G/S/A/E) -L-G- (F/I/L) 
is also well-known in the art as "GLGF (PDZ/DHR) amino 
acid domain." As used herein, "GLGF (PDZ/DHR) amino acid 
domain" means the amino acid sequence (K/R/Q) -X n - 
35 (G/S/A/E) -L-G- (F/I/L) . 



In a preferred embodiment, the signal -transducing protein 
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has at its carboxyl terminus the amino acid sequence 
(S/T) -X- (V/I/L) , wherein each - represents a peptide 
bond, each parenthesis encloses amino acids which are 
alternatives to one other, each slash within such 
5 parentheses separating the alternative amino acids, and 

the X represents any amino acid which is selected from 
the group comprising the twenty naturally occurring amino 
acids. 

10 The compositions of the subject invention may be, but not 

limited to, antibodies, inorganic compounds, organic 
compounds, peptides, peptidomimetic compounds, 

polypeptides or proteins, fragments or derivatives which 
share some or all properties, e.g. fusion proteins. The 

15 composition may be naturally occurring and obtained by 

purification, or may be non-naturally occurring and 
obtained by synthesis. 



Specifically, the composition may be a peptide containing 
20 the sequence (S/T) -X- (V/I/L) -COOH, wherein each 

represents a peptide bond, each parenthesis encloses 
amino acids which are alternatives to one other, each 
slash within such parentheses separating the alternative 
amino acids, the X represents any amino acid which is 
25 selected from the group comprising the twenty naturally 

occurring amino acids. In preferred embodiments, the 
peptide contains one of the following sequences: 
DSENSNFRNEIQSLV, RNEIQSLV, NEIQSLV, EIQSLV, IQSLV, QSLV, 
SLV, I PPDSEDGNEEQSLV , DSEMYNFRSQLASW, IDLASEFLFLSNSFL , 
30 PPTCSQANSGRISTL, SDSNMNMNELSEV , QNFRTYIVSFV, RETIESTV, 

RGFISSLV, TIQSVI, ESLV. A further preferred embodiment 
would be an organic compound which has the sequence Ac- 
SLV-COOH, wherein the Ac represents an acetyl and each - 
represents a peptide bond. 

35 



An example of the subject invention is provided infra . 
Acetylated peptides may be automatically synthesized on 
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an Advanced ChemTech ACT3 57 using previously published 
procedures by analogy. Wang resin was used for each run 
and N°-Fmoc protection was used for all amino acids, and 
then 20% piper idine/DMF and coupling was completed using 
5 DIC/HOBt and subsequently HBTU/DIEA. After the last 

amino acid was coupled, the growing peptide on the resin 
was acetylated with Ac 2 0/DMF. The acetylated peptide was 
purified by HPLC and characterized by FAB -MS and 1 H-NMR . 

10 Further, one skilled in the art would know how to 

construct derivatives of the above -described synthetic 
peptides coupled to non-acetyl groups, such as amines. 

This invention also provides for a composition capable of 
15 inhibiting specific binding between a signal- transducing 

protein having at its carboxyl terminus the amino acid 
sequence (S/T) -X- (V/I/L) , wherein each - represents a 
peptide bond, each parenthesis encloses amino acids which 
are alternatives to one other, each slash within such 
20 parentheses separating the alternative amino acids, the 

X represents any amino acid which is selected from the 
group comprising the twenty naturally occurring amino 
acids, and a cytoplasmic protein. 

25 The compositions of the subject invention includes 

antibodies , inorganic compounds , organic compounds , 
peptides, peptidomimetic compounds, polypeptides or 
proteins, fragments or derivatives which share some or 
all properties, e.g. fusion proteins. 

30 

This invention also provides a method of identifying a 
compound capable of inhibiting specific binding between 
a signal -transducing protein and a cytoplasmic protein 
containing the amino acid sequence (G/S/A/E) -L-G- (F/I/L) , 
35 wherein each - represents a peptide bond, each 

parenthesis encloses amino acids which are alternatives 
to one other, each slash within such parentheses 
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separating the alternative amino acids, which comprises 
(a) contacting the cytoplasmic protein bound to the 
signal-transducing protein with a plurality of compounds 
under conditions permitting binding between a known 
5 compound previously shown to be able to displace the 

signal -transducing protein bound to the cytoplasmic 
protein and the bound cytoplasmic protein to form a 
complex; and (b) detecting the displaced signal- 
transducing protein or the complex formed in step (a) 
10 wherein the displacement indicates that the compound is 

capable of inhibiting specific binding between the 
signal- transducing protein and the cytoplasmic protein. 

The inhibition of the specific binding between the 
15 signal-transducing protein and the cytoplasmic protein 

may affect the transcription activity of a reporter gene . 

Further, in step (b) , the displaced cytoplasmic protein 
or the complex is detected by comparing the transcription 

2 0 activity of a reporter gene before and after the 

contacting with the compound in step (a) , where a change 
of the activity indicates that the specific binding 
between the signal-transducing protein and the 
cytoplasmic protein is inhibited and the signal - 

25 transducing protein is displaced. 

As used herein, the "transcription activity of a reporter 
gene" means that the expression level of the reporter 
gene will be altered from the level observed when the 

30 signal -transducing protein and the cytoplasmic protein 

are bound. One can also identify the compound by 
detecting other biological functions dependent on the 
binding between the signal- transducing protein and the 
cytoplasmic protein. Examples of reporter genes are 

35 numerous and well-known in the art, including, but not 

limited to, histidine resistant genes, ampicillin 
resistant genes, 0-galactosidase gene. 
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Further the cytoplasmic protein may be bound to a solid 
support. Also the compound may be bound to a solid 
support and comprises an antibody, an inorganic compound, 
an organic compound, a peptide, a peptidomimetic 
5 compound, a polypeptide or a protein. 



An example of the method is provided infra . One can 
identify a compound capable of inhibiting specific 
binding between the signal- transducing protein and the 

10 cytoplasmic protein using direct methods of detection 

such as immuno-precipitation of the cytoplasmic protein 
and the compound bound to a detectable marker. Further, 
one could use indirect methods of detection that would 
detect the increase or decrease in levels of gene 

15 expression. As discussed infra , one could construct 

synthetic peptides fused to a LexA DNA binding domain. 
These constructs would be transformed into the L4 0- strain 
with an appropriate cell line having an appropriate 
reporter gene. One could then detect whether inhibition 

2 0 had occurred by detecting the levels of expression of the 

reporter gene. In order to detect the expression levels 
of the reporter gene, one skilled in the art could employ 
a variety of well-known methods, e.g. two-hybrid systems 
in yeast, mammals or other cells. 

25 

Further, the contacting of step (a) may be in vitro, in 
vivo, and specifically in an appropriate cell, e.g. yeast 
cell or mammalian cell. Examples of mammalian cells 
include, but not limited to, the mouse fibroblast cell 
30 NIH 3T3, CHO cells, HeLa cells, Ltk" cells, Cos cells, 

etc . 



Other suitable cells include, but are not limited to, 
prokaryotic or eukaryotic cells, e.g. bacterial cells 
35 (including gram positive cells), fungal cells, insect 

cells, and other animals cells. 
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Further, the signal- transducing protein may be a cell 
surface receptor, signal transducer protein, or a tumor 
suppressor protein. Specifically, the cell surface 
protein is the Fas receptor and may be expressed in cells 
5 derived from organs including, but not limited to, 

thymus, liver, kidney, colon, ovary, breast, testis, 
spleen, lung, stomach, prostate, uterus, skin, head, and 
neck, or expressed in cells comprising T-cells and B- 
cells. In a preferred embodiment, the T-cells are Jurkat 
10 T-cells. 

Further, the cell -surface receptor may be a CD4 receptor, 
p75 receptor, serotonin 2A receptor, or serotonin 2B 
receptor . 

15 

Further, the signal transducer protein may be Protein 
Kinase -C- a- type . 

Further, the tumor suppressor protein may be a 

2 0 adenomatosis polyposis coli tumor suppressor protein or 

colorectal mutant cancer protein. 

Further, the cytoplasmic protein contains the amino acid 
sequence SLGI, specifically Fas-associated phosphatase- 1 . 

25 

This invention also provides a method of identifying a 
compound capable of inhibiting specific binding between 
a signal- transducing protein having at its carboxyl 
terminus the amino acid sequence (S/T) -X- (V/l/L) , wherein 
30 each - represents a peptide bond, each parenthesis 

encloses amino acids which are alternatives to one other, 
each slash within such parentheses separating the 
alternative amino acids, the X represents any amino acid 
which is selected from the group comprising the twenty 

3 5 naturally occurring amino acids, and a cytoplasmic 

protein which comprises (a) contacting the signal- 
transducing protein bound to the cytoplasmic protein with 
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a plurality of compounds under conditions permitting 
binding between a known compound previously shown to be 
able to displace the cytoplasmic protein bound to the 
signal-transducing protein and bound signal -transducing 
5 protein to form a complex; and (b) detecting the 

displaced cytoplasmic protein or the complex of step (a) , 
wherein the displacement indicates that the compound is 
capable of inhibiting specific binding between the 
signal -transducing protein and the cytoplasmic protein. 

10 The inhibition of the specific binding between the 

signal -transducing protein and the cytoplasmic protein 
affects the transcription activity of a reporter gene. 
Further, in step (b) , the displaced signal- transducing 
protein or the complex is detected by comparing the 

15 transcription activity of a reporter gene before and 

after the contacting with the compound in step (a) , where 
a change of the activity indicates that the specific 
binding between the signal- transducing protein and the 
cytoplasmic protein is inhibited and the cytoplasmic 

2 0 protein is displaced . 



Further, in step (b) , the displaced cytoplasmic protein 
or the complex is detected by comparing the transcription 
activity of a reporter gene before and after the 
25 contacting with the compound in step (a) , where a change 

of the activity indicates that the specific binding 
between the signal -transducing protein and the 
cytoplasmic protein is inhibited and the signal - 
transducing protein is displaced. 

30 

As used herein, the "transcription activity of a reporter 
gene" means that the expression level of the reporter 
gene will be altered from the level observed when the 
signal- transducing protein and the cytoplasmic protein 
35 are bound. One can also identify the compound by 

detecting other biological functions dependent on the 
binding between the signal-transducing protein and the 
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cytoplasmic protein. Examples of reporter genes are 
numerous and well-known in the art, including, but not 
limited to, histidine resistant genes, ampicillin 
resistant genes, /?-galactosidase gene. 

5 

Further, the cytoplasmic protein may be bound to a solid 
support or the compound may be bound to a solid support, 
comprises an antibody, an inorganic compound, an organic 
compound, a peptide, a peptidomimetic compound, a 
10 polypeptide or a protein. 



An example of the method is provided infra . One could 
identify a compound capable of inhibiting specific 
binding between the signal- transducing protein and the 

15 cytoplasmic protein using direct methods of detection 

such as immuno-precipitation of the cytoplasmic protein 
and the compound bound with a detectable marker. 
Further, one could use indirect methods of detection that 
would detect the increase or decrease in levels of gene 

20 expression. As discussed infra , one could construct 

synthetic peptides fused to a LexA DNA binding domain. 
These constructs would be transformed into L40-strain 
with an appropriate cell line having a reporter gene. 
One could then detect whether inhibition had occurred by 

25 detecting the levels of the reporter gene. Different 

methods are also well known in the art, such as employing 
a yeast two-hybrid system to detect the expression of a 
reporter gene . 

30 Further the contacting of step (a) can be in vitro or in 

vivo , specifically in a yeast cell or a mammalian cell. 
Examples of mammalian cells include, but not limited to, 
the mouse fibroblast cell NIH 3T3, CHO cells, HeLa cells, 
Ltk" cells, Cos cells, etc. 



Other suitable cells include, but are not limited to, 
prokaryotic or eukaryotic cells, e.g. bacterial cells 
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(including gram positive cells), fungal cells/ insect 
cells, and other animals cells. 



Further, the signal- transducing protein is a cell surface 
5 receptor, signal transducer protein, or a tumor 

suppressor protein. Specifically, the cell surface 
protein is the Fas receptor and is expressed in cells 
derived from organs comprising thymus, liver, kidney, 
colon, ovary, breast, testis, spleen, stomach, prostate, 
10 uterus, skin, head and neck, or expressed in cells 

comprising T-cells and B-cells. In a preferred 

embodiment, the T-cells are Jurkat T-cells. 



Further, the cell-surface receptor may be a CD4 receptor, 
15 p75 receptor, serotonin 2A receptor, or serotonin 2B 

receptor . 

Further, the signal transducer protein may be Protein 
Kinase- C -a -type . 

20 

Further, the tumor suppressor protein may be a 
adenomatosis polyposis coli tumor suppressor protein or 
colorectal mutant cancer protein. 

25 Further, the cytoplasmic protein contains the amino acid 

sequence SLGI , specifically Fas-associated phosphatase- 
1 . 

This invention also provides a method of inhibiting the 
30 proliferation of cancer cells comprising the above- 

described composition, specifically, wherein the cancer 
cells are derived from organs including, but not limited 
to, thymus, liver, kidney, colon, ovary, breast, testis, 
spleen, stomach, prostate, uterus, skin, head and neck, 
3 5 or wherein the cancer cells are derived from cells 

comprising T-cells and B-cells. 
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This invention also provides a method of inhibiting the 
proliferation of cancer cells comprising the compound 
identified by the above-described method, wherein the 
cancer cells are derived from organs including, but not 
limited to, thymus, liver, kidney, colon, ovary, breast, 
testis, spleen, stomach, prostate, uterus, skin, head and 
neck, or wherein the cancer cells are derived from cells 
comprising T-cells and B-cells. 

The invention also provides a method of treating cancer 
in a subject which comprises introducing to the subject's 
cancerous cells an amount of the above -described 
composition effective to result in apoptosis of the 
cells, wherein the cancer cells are derived from organs 
including, but not limited to, thymus, liver, kidney, 
colon, ovary, breast, testis, spleen, stomach, prostate, 
uterus, skin, head and neck, or wherein the cancer cells 
are derived from cells comprising T-cells and B-cells. 

As used herein "apoptosis" means programmed cell death of 
the cell. The mechanisms and effects of programmed cell 
death differs from cell lysis. Some observable effects 
of apoptosis are: DNA fragmentation and disintegration 
into small membrane -bound fragments called apoptotic 
bodies . 

Means of detecting whether the composition has been 
effective to result in apoptosis of the cells are well- 
known in the art. One means is by assessing the 
morphological change of chromatin using either phase 
contrast or fluorescence microscopy. 

The invention also provides for a method of inhibiting 
the proliferation of virally infected cells comprising 
the above -described composition or the compound 
identified by the above -described, wherein the virally 
infected cells comprise Hepatitis B virus, Epstein-Barr 
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virus, influenza virus, Papilloma virus, Adeno virus, 
Human T-cell lymphtropic virus, type 1 or HIV. 

The invention also provides a method of treating a 
5 virally- infected subject which comprises introducing to 

the subject's virally- infected cells the above-described 
composition effective to result in apoptosis of the cells 
or the compound identified by the above -described method 
of claim 27 effective to result in apoptosis of the 
10 cells, wherein the virally infected cells comprise the 

Hepatitis B virus, Epstein-Barr virus, influenza virus, 
Papilloma virus, Adeno virus, Human T-cell lymphtropic 
virus, type 1 or HIV. 



15 Means of detecting whether the composition has been 

effective to result in apoptosis of the cells are well- 
known in the art. One means "is by assessing the 
morphological change of chromatin using either phase 
contrast or fluorescence microscopy. 

20 

This invention also provides for a pharmaceutical 
composition comprising the above -described composition of 
in an effective amount and a pharmaceutically acceptable 
carrier . 

25 

This invention also provides for a pharmaceutical 
composition comprising the compound identified by the 
above-described method of in an effective amount and a 
pharmaceutically acceptable carrier. 

30 

This invention further provides a composition capable of 
specifically binding a signal- transducing protein having 
at its carboxyl terminus the amino acid sequence (S/T) -X- 
(V/L/I) , wherein each - represents a peptide bond, each 
35 parenthesis encloses amino acids which are alternatives 

to one other, each slash within such parentheses 
separating the alternative amino acids, and the X 
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represents any amino acid which is selected from the 
group comprising the twenty naturally occurring amino 
acids. The composition may contain the amino acid 
sequence (G/S/A/E) -L-G- (F/I/L) , wherein each - represents 
5 a peptide bond, each parenthesis encloses amino acids 

which are alternatives to one other, and each slash 
within such parentheses separating the alternative amino 
acids. In a preferred embodiment, the composition 
contains the amino acid sequence (K/R/Q) -X n - (G/S/A/E) -L-G- 
10 (F/I/L) . wherein X represents any amino acid which is 

selected from the group comprising the twenty naturally 
occurring amino acids and n represents at least 2, but 
not more than 4. In another preferred embodiment, the 
composition contains the amino acid sequence SLGI . 

15 

This invention further provides a method for identifying 
compounds capable of binding to a signal- transducing 
protein having at its carboxyl terminus the amino acid 
sequence (S/T) -X- (V/L/I) , wherein each - represents a 

2 0 peptide bond, each parenthesis encloses amino acids which 

are alternatives to one other, each slash within such 
parentheses separating the alternative amino acids, the 
X represents any amino acid which is selected from the 
group comprising the twenty naturally occurring amino 

25 acids, which comprises (a) contacting the signal- 

transducing protein with a plurality of compounds under 
conditions permitting binding between a known compound 
previously shown to be able to bind to the signal - 
transducing protein to form a complex; and (b) detecting 

30 the complex formed in step (a) so as to identify a 

compound capable of binding to the signal- transducing 
protein. Specifically, the identified compound contains 
the amino acid sequence (G/S/A/E) -L-G- (F/I/L) . In a 
further preferred embodiment, the identified compound 

35 contains the amino acid sequence SLGI. 



Further, in the above -described method, the signal- 




WO 98/05347 



PCT/US97/12677 



-22- 



10 



15 



20 



25 



30 



transducing protein may be bound to a solid support. 
Also, the compound may be bound to a solid support, and 
may comprise an antibody, an inorganic compound, an 
organic compound, a peptide, a peptidomimetic compound, 
a polypeptide or a protein. 

Further, the signal- transducing protein may be a cell- 
surface receptor or a signal transducer. Specifically, 
the signal -transducing protein may be the Fas receptor, 
CD4 receptor, p75 receptor, serotonin 2A receptor, 
serotonin 2B receptor, or protein kinase -C-oi- type . 

This invention also provides a method of restoring 
negative regulation of apoptosis in a cell comprising the 
above-described composition or a compound identified by 
the above-described method. 

As used herein "restoring negative regulation of 
apoptosis" means enabling the cell from proceeding onto 
programmed cell death . 

For example, cells that have functional Fas receptors and 
Fas -associated phosphatase 1 do not proceed onto 
programmed cell death or apoptosis due to the negative 
regulation of Fas by the phosphatase. However, if Fas- 
associated phosphatase 1 is unable to bind to the 
carboxyl terminus of the Fas receptor ( (S/T) -X- (V/L/I ) 
region) , e.g. mutation or deletion of at least one of 
the amino acids in the amino acid sequence (G/S/A/E) -L-G- 
(F/I/L) , the cell will proceed to apoptosis. By 
introducing a compound capable of binding to the carboxyl 
terminus of the Fas receptor, one could mimic the effects 
of a functional phosphatase and thus restore the negative 
regulation of apoptosis. 



This invention also provides a method of preventing 
apoptosis in a cell comprising the above -described 
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composition or a compound identified by the above - 
described method. 

This invention also provides a means of treating 
pathogenic conditions caused by apoptosis of relevant 
cells comprising the above-described composition or the 
compound identified by the above -de scribed method. 

This invention is illustrated in the Experimental Details 
section which follows. These sections are set forth to 
aid in an understanding of the invention but are not 
intended to, and should not be construed to, limit in any 
way the invention as set forth in the claims which follow 
thereafter . 
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Experimental Details 
5 Methods and Materials 

1. Screening a semi - random and random peptide library. 

To create numerous mutations in a restricted DNA 
10 sequence, PCR mutagenesis with degenerate 

oligonucleotides was employed according to a protocol 
described elsewhere (Hill, et al. 1987) . Based on the 
homology between human and rat, two palindromic sequences 
were designed for construction of semi -random library. 
15 The two primers used were 

5 ' - CGGAATT^NNNNNNNNNAAC^ 

NTGAGGATCCTCA-3 ' (Seq. I.D. No.: 30) and 

5 ' - CG GAATTC GACTCAGAANlSnsnsn^ 

CTGA GGATCC TCA- 3 ' (Seq. I.D. No.: 31). Briefly, the two 

20 primers (each 200 pmol) , purified by HPLC, were annealed 

at 70 °C for 5 minutes and cooled at 23 °C for 60 minutes. 
A Klenow fragment (5 U) was used for filling in with a 
dNTP mix (final concentration, 1 mM per each dNTP) at 
23°C for 60 minutes. The reaction was stopped with 1 ptl 

25 of 0.5 M EDTA and the DNA was purified with ethanol 

precipitation. The resulting double -stranded DNA was 
digested with EcoRI and BamHI and re-purified by 
electrophoresis on non- denaturing polyacrylamide gels. 
The double-strand oligonucleotides were then ligated into 

3 0 the EcoRI -BamHI sites of the pBTM116 plasmid. The 

ligation mixtures were electroporated into the E. coli 
XLl-Blue MRF ' (Stratagene) for the plasmid library. The 
large scale transformation was carried out as previously 
reported. The plasmid library was transformed into 

35 L40-strain cells (MATa, trpl , leu2 , h±s3, ade2 , 

LYS2: (lexAop) 4 -HIS3, URA3 : : (lexAopf -lacZ) carrying the 
plasmid pVP16-31 containing a FAP-1 cDNA (Sato, et al . 
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1995) . Clones that formed on hist idine-def icient medium 
(His + ) were transferred to plates containing 40 /*g/ml 
X-gal to test for a blue reaction product (E-gal + ) in 
plate and filter assays. The clones selected by His + and 
S-gal + assay were tested for further analysis. The 
palindromic oligonucleotide, 
5 ' -CG GAATTC - (NNN) A ^ c - TG AGGATCC TCA- 3 ' (Seq. I.D. No. 32) , 
was used for the construction of the random peptide 
library. 

2. Synthesis of peptides 

Peptides were automatically synthesized on an Advanced 
ChemTech ACT357 by analogy to published procedures 
(Schnorrenberg and Gerhardt, 1989). Wang resin (0.2-0.3 
mmole scale) was used for each run and JST-Fmoc protection 
was employed for all amino acids. Deprotection was 
achieved by treatment with 20% piperidine/DMF and 
coupling was completed using DIC/HOBt and subsequent 
HBTU/DIEA. After the last amino acid was coupled, the 
growing peptide on the resin was acetylated with Ac 2 0/DMF. 
The peptide was cleaved from the resin with concomitant 
removal of all protecting groups by treating with TFA. 
The acetylated peptide was purified by HPLC and 
characterized by FAB-MS and 1 H - NMR . 

3. Inhibition asssay of Fas/FAP-1 binding using the C- 
terminal 15 amino acids of Fas. 

HFAP-10 cDNA (Sato, et al . 1995) subcloned into the 
Bluescript vector pSK-II (Stratagene) was in 
vitro-translated from an internal methionine codon in the 
presence of 35 S-L-methionine using a coupled in vitro 
transcription/translation system (Promega, TNT lysate) 
and T7 RNA polymerase. The resulting 35 S- labeled protein 
was incubated with GST-Fas fusion proteins that had been 
immobilized on GST-Sepharose 4B affinity beads 
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( Pharmacia) in a buffer containing 150 mM NaCl, 50 mM 
Tris [pH 8.0], 5 mM DTT, 2 mM EDTA, 0.1 % NP-40, 1 mM 
PMSF, 50 /xg/ml leupeptin, 1 mM Benzamidine, and 7 ptg/ml 
pepstatin for 16 hours at 4 °C. After washing vigorously 
5 4 times in the same buffer, associated proteins were 

recovered with the glutathione-Sepharose beads by 
centrifugation, eluted into boiling Laemmli buffer, and 
analyzed by SDS-PAGE and f luorography . 



10 4. Inhibition assay of terminal 15 amino acids of Fas 

and inhibitory effect of Fas/FAP-1 binding using 
diverse tripeptides . 



In vitro-translated [ 35 S]HFAP-1 was purified with a NAP-5 
15 column (Pharmacia) and incubated with 3 /xM of GST- fusion 

proteins for 16 hours at 4 8 C. After washing 4 times in 
the binding buffer, radioactivity incorporation was 
determined in a b counter. The percentage of binding 
inhibition was calculated as follows: percent inhibition 
20 = [radioactivity incorporation using GST-Fas (191-335) 

with peptides - radioactivity incorporation using GST- Fas 
(191-320) with peptides] / [radioactivity incorporation 
using GST-Fas (191-335) without peptides - radioactivity 
incorporation using GST-Fas (191-320) without peptides] . 



5. Interaction of the C-terminal 3 amino acids of Fas 
with FAP-1 in yeast and in vitro. 



30 The bait plasmids, pBTM116 (LexA) -SLV, -PLV, -SLY, and 

- SLA, were constructed and transformed into L4 0- strain 
with pVP16 -FAP-1 or -ras. Six independent clones from 
each transf ormants were picked up for the analysis of 
growth on histidine-def icient medium. GST-Fas, -SLV, and 

35 PLV were purified with GST-Sepharose 4B affinity beads 

(Pharmacia) . The methods for in vitro binding are 
described above. 
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6. Immuno-precipitation of native Fas with GST-FAP-1 
and inhibition of Fas/FAP-1 binding with Ac-SLV. 



10 



30 



GST- fusion proteins with or without FAP-1 were incubated 
with cell extracts from Jurkat T-cells expressing Fas. 
The bound Fas was detected by Western analysis using 
anti-Fas monoclonal antibody (F22120, Transduction 
Laboratories) . The tripeptides, Ac-SLV and Ac-SLY were 
used for the inhibition assay of Fas/FAP-1 binding. 



7. Microinjection of Ac-SLV into the DLD-1 cell line. 
DLD-1 human colon cancer cells were cultured in RPMI 164 0 
medium containing 10% FCS . For microinjection, cells 
were plated on CELLocate (Eppendorf ) at 1 X 10 5 cells/2 ml 

15 in a 35 mm plastic culture dish and grown for 1 day. Just 

before microinjection, Fas monoclonal antibodies CH11 
(MBL International) was added at the concentration of 500 
ng/ml . All microinjection experiments were performed 
using an automatic microinjection system (Eppendorf 

20 trans jector 5246, micro-manipulator 5171 and Femtotips) 

(Pantel, et al . 1995). Synthetic tripeptides were 
suspended in 0.1% (w/v) FITC-Dextran (Sigma) /K-PBS at the 
concentration of 100 mM. The samples were microinjected 
into the cytoplasmic region of DLD-1 cells. Sixteen to 

25 2 0 hours post inj ection, the cells were washed with PBS 

and stained with 10 /ig/ml Hoechst 33342 in PBS. After 
incubation at 3 7°C for 3 0 minutes, the cells were 
photographed and the cells showing condensed chromatin 
were counted as apoptotic. 



8. Quantitation of apoptosis in microinjected DLD-1 
cells . 



For each experiment, 25-100 cells were microinjected. 
35 Apoptosis of microinjected cells was determined by 

assessing morphological changes of chromatin using phase 
contrast and fluorescence microscopy (Wang, et al . , 1995; 
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McGahon, et al . , 1995). The data are means +/- S.D. for 
two or three independent determinations. 

Discussion 

5 

In order to identify the minimal peptide stretch in the 
C- terminal region of the Fas receptor necessary for FAP-1 
binding, an in vitro inhibition assay of Fas/FAP-1 
binding was used using a series of synthetic peptides as 

10 well as yeast two-hybrid system peptide libraries (Figure 

2A) . First, semi-random libraries (based on the homology 
between human and rat Fas) (Figures 2B and 2C) of 15 
amino acids fused to a LexA DNA binding domain were 
constructed and co- transformed into yeast strain L4 0 with 

15 pVP16-31 (Sato, et al . 1995) that was originally isolated 

as FAP-1. After the selection of 200 His + colonies from an 
initial screen of 5.0 X 10 6 (Johnson, et al . 1986) 
transf ormants , 100 colonies that were jS-galactosidase 
positive were picked for further analysis. Sequence 

20 analysis of the library plasmids encoding the C- terminal 

15 amino acids revealed that all of the C-termini were 
either valine, leucine or isoleucine residues. Second, 
a random library of 4-15 amino acids fused to a LexA DNA 
binding domain was constructed and screened according to 

25 this strategy (Figure 2D) . Surprisingly, all of the third 

amino acid residues from the C-termini were serine, and 
the results of C-terminal amino acid analyses were 
•identical to the screening of the semi -random cDNA 
libraries. No other significant amino acid sequences were 

3 0 found in these library screenings, suggesting that the 

motifs of the last three amino acids (tS-X-V/L/I) are 
very important for the association with the third PDZ 
domain of FAP - 1 and play a crucial role in 
protein-protein interaction as well as for the regulation 

35 of Fas-induced apoptosis. To further confirm whether the 

last three amino acids are necessary and sufficient for 
Fas/FAP-1 binding, plasmids of the LexA-SLV, -PL.V, -PLY, 
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-SLY, and -SLA fusion proteins were constructed and 
co-transformed into yeast with pVP16-FAP-l. The results 
showed that only LexA-SLV associated with FAP-1, whereas 
LexA-PLV, -PLY, -SLY, and -SLA did not (Figure 4 A) . In 
vitro binding studies using various GST- tripeptide 
fusions and in vi tro- translated FAP-1 were consistent 
with these results (Figure 4B) . 

In addition to yeast two-hybrid approaches, in vitro 
inhibition assay of Fas/FAP-1 binding was also used. 
First, a synthetic peptide of the C- terminal 15 amino 
acids was tested whether it could inhibit the binding of 
Fas and FAP-1 in vitro (Figure 3 A) . The binding of in 
vi tro- translated FAP-1 to GST-Fas was dramatically 
reduced and dependent on the concentration of the 
synthetic 15 amino acids of Fas. In contrast with these 
results, human PAMP peptide (Kitamura, et al . 1994) as a 
negative control had no effect on Fas/FAP-1 binding 
activity under the same biochemical conditions. Second, 
the effect of truncated C- terminal synthetic peptides of 
Fas on Fas/FAP-1 binding in vitro was examined. As shown 
in Figure 3B, only the three C-terminal amino acids 
(Ac-SLV) were sufficient to obtain the same level of 
inhibitory eff ect on the binding of FAP-1 to Fas as 
achieved with the 4-15 synthetic peptides. Furthermore, 
Fas/FAP-1 binding was extensively investigated using the 
scanned tripeptides to determine the critical amino acids 
residues required for inhibition (Figure 3C) . The 
results revealed that the third amino acids residues from 
the C- terminus, and the C-terminal amino acids having the 
strongest inhibitory effect were either serine or 
threonine; and either valine, leucine, or isoleucine, 
respectively. However, there were no differences among 
the second amino acid residues from the C-terminus with 
respect to their inhibitory effect on Fas/FAP-1 binding. 
These results were consistent with those of the yeast 
two-hybrid system (Figures 2C and 2D) . Therefore, it was 
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concluded that the C- terminal three amino acids (SLV) are 
critical determinants of Fas binding to the third PDZ 
domain of FAP-1 protein. 



5 To further substantiate that the PDZ domain interacts 

with tS/T-X-V/L/I under more native conditions, GST- fused 
FAP-1 proteins were tested for their ability to interact 
with Fas expressed in Jurkat T-cells. The results 
revealed that the tripeptide Ac-SLV, but not Ac-SLY, 

10 abolished in a dose -dependent manner the binding activity 

of FAP-1 to Fas proteins extracted from Jurkat T-cells 
(Figures 4C and 4D) . This suggests that the C- terminal 
amino acids tSLV are the minimum binding site for FAP-1, 
and that the amino acids serine and valine are critical 

15 for this physical association. 

To next examine the hypothesis that the physiological 
association between the C-terminal three amino acids of 
Fas and the third PDZ domain of FAP-1 is necessary for 

20 the in vivo function of FAP-1 as a negative regulator of 

Fas -mediated signal transduction, a microinjection 
experiment was employed with synthetic tripeptides in a 
colon cancer cell line, DLD-1, which expresses both Fas 
and FAP-1, and is resistant to Fas -induced apoptosis. 

25 The experiments involved the direct microinjection of the 

synthetic tripeptides into the cytoplasmic regions of 
single cells and the monitoring of the physiological 
response to Fas-induced apoptosis in vivo. The results 
showed that microinjection of Ac-SLV into DLD-1 cells 

3 0 dramatically induced apoptosis in the presence of 

Fas -monoclonal antibodies (CH11, 50 0 ng/ml) (Figures 5A, 
5E and Figure 6) , but that microinjection of Ac-SLY and 
PBS/K did not (Figures 5B, 5F and Figure 6) . These 
results strongly support the hypothesis that the physical 

35 association of FAP-1 with the C-terminus of Fas is 

essential for protecting cells from Fas- induced 
apoptosis . 
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In summary, it was found that the C-terminal SLV of Fas 
is alone necessary and sufficient for binding to the 
third PDZ domain of FAP-1. Secondly, it is proposed that 
the new consensus motif of tS/T-X-V/L/l for such binding 
5 to the PDZ domain, instead of tS/T-X-V. It is therefore 

possible that FAP-1 plays important roles for the 
modulation of signal transduction pathways in addition to 
its physical interaction with Fas. Thirdly, it is 
demonstrated that the targeted induction of Fas -mediated 

10 apoptosis in colon cancer cells by direct microinjection 

of the tripeptide Ac-SLV. Further investigations 
including the identification of a substrate (s) of FAP-1 
and structure-function analysis will provide insight to 
the potential therapeutic applications of Fas/FAP-1 

15 interaction in cancer as well as provide a better 

understanding of the inhibitory effect of FAP-1 on 
Fas -mediated signal transduction. 
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SECOND SERIES OF EXPERIMENTS 



FAP-1 was originally identified as a membrane -associated 
protein tyrosine phosphatase which binds to the C- 
5 terminus of Fas, and possesses six PDZ domains (also 

known as DHR domain or GLGF repeat) . PDZ domain has 
recently been shown as a novel module for specific 
protein-protein interaction, and it appears to be 
important in the assembly of membrane proteins and also 

10 in linking signaling molecules in a multiprotein complex. 

In recent comprehensive studies, it was found that the 
third PDZ domain of FAP-1 specifically recognized the 
sequence motif t (S/T) -X-V and interacts with the C- 
terminal three amino acids SLV of Fas (Fig. 9) . In order 

15 to investigate the possibility that FAP-1 also interacts 

with the C-terminal region of p75NGFR (Fig. 8), an in 
vitro binding assay, was performed as well as, a yeast 
two-hybrid analysis by using a series of deletion mutants • 
of p75NGFR. The results revealed that the C-terminal 

2 0 cytoplasmic region of p75NGFR, which is highly conserved 

among all species, interacts with FAP-1 (Fig. 10) . 
Furthermore, the C-terminal three amino acids SPV of 
p75NGFR were necessary and sufficient for the interaction 
with the third PDZ domain of FAP-1 (Fig. 11A and 11B) . 
25 Since FAP-1 expression was found highest in fetal brain, 

these findings imply that interaction of FAP-1 with 
p75NGFR plays an important role for signal transduction 
pathway via p75NGFR in neuronal cells as well as in the 
formation of the initial signal-transducing complex for 

3 0 p7 5NGFR. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

5 

(i) APPLICANT: Takaaki Sato and Junn Yanagisawa 

(ii) TITLE OF INVENTION: COMPOUNDS THAT INHIBIT THE 

INTERACTION BETWEEN SIGNAL- 
10 TRANSDUCING PROTEINS AND THE GLGF 

(PDZ/DHR) DOMAIN AND USES THEREOF 

(iii) NUMBER OF SEQUENCES: 33 

15 (iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: Cooper & Dunham LLP 

(B) STREET: 1185 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: New York 
20 (E) COUNTRY: U.S.A. 

(F) ZIP: 10036 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

2 5 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 

3 0 (A) APPLICATION NUMBER: Not Yet Known 

(B) FILING DATE: 18-JUL-1997 
<C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 
3 5 (A) NAME: White, John P 

(B) REGISTRATION NUMBER: 2 8,67 8 

<C) REFERENCE / DOCKET NUMBER: 0575/48962 -A-PCT/JPW/JKM 

(ix) TELECOMMUNICATION INFORMATION: 
40 (A) TELEPHONE: (212) 278-0400 

(B) TELEFAX: (212) 391-0525 

(2) INFORMATION FOR SEQ ID NO : 1 : 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS : single 
( D ) TOPOLOGY : 1 inear 

50 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

55 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Gly/Ser/Ala/Glu Leu Gly Phe/Ile/Leu 
60 1 



(2) INFORMATION FOR SEQ ID NO : 2 : 



65 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

<ii) MOLECULE TYPE: peptide 

5 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Lys/Arg/Gln Xaa (n) Gly/Ser/Ala/Glu Leu Gly Phe/Ile/Leu 
1 5 



15 



25 



35 



45 



55 



65 



( 2 ) INFORMATION FOR SEQ ID NO : 3 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
2 0 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



Ser Leu Gly lie 
1 



(2) INFORMATION FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
4 0 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Ser/Thr Xaa Val/Ile/Leu 
1 



(2) INFORMATION FOR SEQ ID NO: 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
6 0 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
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Asp Ser Glu Asn Ser Asn Phe Arg Asn Glu He Gin Ser Leu Val 
15 10 15 



5 (2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Ser He Ser Asn Ser Arg Asn Glu Asn Glu Gly Gin Ser Leu Glu 
15 10 15 



20 



30 



35 



60 



(2) INFORMATION FOR SEQ ID NO : 7 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
2 5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Ser Thr Pro Asp Thr Gly Asn Glu Asn Glu Gly Gin Cys Leu Glu 
15 10 15 

(2) INFORMATION FOR SEQ ID NO : 8 : 



(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 4 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

45 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Glu Ser Leu Val 
50 1 



(2) INFORMATION FOR SEQ ID NO ; 9 : 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



6 5 Thr lie Gin Ser Val lie 

1 5 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
5 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



10 



15 



50 



60 



<ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Arg Gly Phe lie Ser Ser Leu Val 
1 5 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

2 0 (A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 <ii> MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Arg Glu Thr lie Glu Ser Thr Val 
30 1 5 

(2) INFORMATION FOR SEQ ID NO: 12: 

3 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

40 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

45 Gin Asn Phe Arg Thr Tyr lie Val Ser Phe Val 

15 10 



(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO : 13 : 

Ser Asp Ser Asn Met Asn Met Asn Glu Leu Ser Glu Val 
15 10 



65 



(2) INFORMATION FOR SEQ ID NO : 14 : 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 15 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

10 Pro Pro Thr Cys Ser Gin Ala Asn Ser Gly Arg lie Ser Thr Leu 

15 10 15 



15 



25 



60 



(2) INFORMATION FOR SEQ ID NO: 15: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 



He Asp Leu Ala Ser Glu Phe Leu Phe Leu Ser Asn Ser Phe Leu 
15 io 15 



3 0 (2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

3 5 (C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

Asp Ser Glu Met Tyr Asn Phe Arg Ser Gin Leu Ala Ser Val Val 
15 10 15 

4 5 (2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

5 0 (C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

He Pro Pro Asp Ser Glu Asp Gly Asn Glu Glu Gin Ser Leu Val 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
6 5 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

5 Gin Ser Leu Val 

1 



10 



20 



55 



65 



(2) INFORMATION FOR SEQ ID NO: 19: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



lie Gin Ser Leu Val 
1 5 



2 5 (2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

3 0 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Glu lie Gin Ser Leu Val 
1 5 

4 0 (2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Asn Glu lie Gin Ser Leu Val 
1 5 



(2) INFORMATION FOR SEQ ID NO: 22: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
60 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 
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Arg Asn Glu lie Gin Ser Leu Val 
1 5 



5 (2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

10 (C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Asp Ser Glu Asn Ser Asn Phe Arg Asn Glu lie Gin Ser Leu Val 
15 10 15 



20 



30 



35 



50 



65 



(2) INFORMATION FOR SEQ ID NO: 24; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 amino acids 
25 (B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:24: 

Met Gly Ala Gly Ala Thr Gly Arg Ala Met Asp Gly Pro Arg Leu Leu 
15 10 15 

Leu Leu Leu Leu Leu Gly Val Ser Leu Gly Gly Ala Lys Glu Ala Cys 
20 25 30 



Pro Thr Gly Leu Tyr Thr His Ser Gly Glu Cys Cys Lys Ala Cys Asn 
40 35 40 45 

Leu Gly Glu Gly Val Ala Gin Pro Cys Gly Ala Asn Gin Thr Val Cys 
50 55 60 

45 Glu Pro Cys Leu Asp Ser Val Thr Phe Ser Asp Val Val Ser Ala Thr 

65 70 75 80 



Glu Pro Cys Lys Pro Cys Thr Glu Cys Val Gly Leu Gin Ser Met Ser 
85 90 95 

Ala Pro Cys Val Glu Ala Asp Asp Ala Val Cys Arg Cys Ala Tyr Gly 
100 105 110 



Tyr Tyr Gin Asp Glu Thr Thr Gly Arg Cys Glu Ala Cys Arg Val Cys 
55 115 120 125 

Glu Ala Gly Ser Gly Leu Val Phe Ser Cys Gin Asp Lys Gin Asn Thr 
130 135 140 

60 Val Cys Glu Glu Cys Pro Asp Gly Thr Tyr Ser Asp Glu Ala Asn His 

145 150 155 160 



Val Asp Pro Cys Leu Pro Cys Thr Val Cys Glu Asp Thr Glu Arg Gin 
165 170 175 

Leu Arg Glu Cys Thr Arg Trp Ala Asp Ala Glu Cys Glu Glu lie Pro 
180 185 190 
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Gly Arg Trp lie Thr Arg Ser Thr Pro Pro Glu Gly Ser Asp Ser Thr 
195 200 205 

Ala Pro Ser Thr Gin Glu Pro Glu Ala Pro Pro Glu Gin Asp Leu lie 
5 210 215 220 

Ala Ser Thr Val Ala Gly Val Val Thr Thr Val Met Gly Ser Ser Gin 
225 230 235 240 

10 Pro Val Val Thr Arg Gly Thr Thr Asp Asn Leu lie Pro Val Tyr Cys 

245 250 255 



15 



30 



45 



Ser lie Leu Ala Ala Val Val Val Gly Leu Val Ala Tyr lie Ala Phe 
260 265 270 

Lys Arg Trp Asn Ser Cys Lys Gin Asn Lys Gly Gly Ala Asn Ser Arg 
275 280 285 



Pro Val Asn Gin Thr Pro Pro Pro Glu Gly Glu Lys lie His Ser Asp 
20 290 295 300 

Ser Gly lie Ser Val Asp Ser Gin Ser Leu His Asp Gin Gin Pro His 
305 310 315 320 

2 5 Thr Gin Thr Ala Ser Gly Gin Ala Leu Lys Gly Asp Gly Gly Leu Tyr 

325 330 335 



Ser Ser Leu Pro Pro Ala Lys Arg Glu Glu Val Glu Lys Leu Leu Asn 
340 345 350 

Gly Ser Ala Gly Asp Thr Trp Arg His Leu Ala Gly Glu Leu Gly Tyr 
355 360 365 



Gin Pro Glu His lie Asp Ser Phe Thr His Glu Ala Cys Pro Val Arg 
35 370 375 380 

Ala Leu Leu Ala Ser Trp Ala Thr Gin Asp Ser Ala Thr Leu Asp Ala 
385 390 395 400 

4 0 Leu Leu Ala Ala Leu Arg Arg lie Gin Arg Ala Asp Leu Val Glu Ser 

405 410 415 



Leu Cys Ser Glu Ser Thr Ala Thr Ser Pro Val 
420 425 



(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 4 58 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

Met Asn Arg Gly Val Pro Phe Arg His Leu Leu Leu Val Leu Gin Leu 
60 1 5 10 15 

Ala Leu Leu Pro Ala Ala Thr Gin Gly Lys Lys Val Val Leu Gly Lys 
20 25 30 

65 Lys Gly Asp Thr Val Glu Leu Thr Cys Thr Ala Ser Gin Lys Lys Ser 

35 40 45 
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Ile Gin Phe His Trp Lys Asn Ser Asn Gin lie Lys lie Leu Gly Asn 
50 55 60 

Gin Gly Ser Phe Leu Thr Lys Gly Pro Ser Lys Leu Asn Asp Arg Ala 

5 65 70 75 80 

Asp Ser Arg Arg Ser Leu Trp Asp Gin Gly Asn Phe Pro Leu lie lie 
85 90 95 

10 Lys Asn Leu Lys lie Glu Asp Ser Asp Thr Tyr lie Cys Glu Val Glu 

100 105 HO 



15 



30 



45 



60 



Asp Gin Lys Glu Glu Val Gin Leu Leu Val Phe Gly Leu Thr Ala Asn 
115 120 125 

Ser Asp Thr His Leu Leu Gin Gly Gin Ser Leu Thr lie Thr Leu Glu 
130 135 140 



Ser Pro Pro Gly Ser Ser Pro Ser Val Gin Cys Arg Ser Pro Arg Gly 
20 145 150 155 160 

Lys Asn lie Gin Gly Gly Lys Thr Leu Ser Val Ser Gin Leu Glu Leu 
165 170 175 

25 Gin Asp Ser Gly Thr Trp Thr Cys Thr Val Leu Gin Asn Gin Lys Lys 

180 185 190 



Val Glu Phe Lys lie Asp lie Val Val Leu Ala Phe Gin Lys Ala Ser 

195 200 205 

Ser lie Val Tyr Lys Lys Glu Gly Glu Gin Val Glu Phe Ser Phe Pro 
210 215 220 



Leu Ala Phe Thr Val Glu Lys Leu Thr Gly Ser Gly Glu Leu Trp Trp 

35 225 230 235 240 

Gin Ala Glu Arg Ala Ser Ser Ser Lys Ser Trp lie Thr Phe Asp Leu 

245 250 255 

40 Lys Asn Lys Glu Val Ser Val Lys Arg Val Thr Gin Asp Pro Lys Leu 

260 265 270 



Gin Met Gly Lys Lys Leu Pro Leu His Leu Thr Leu Pro Gin Ala Leu 
275 280 285 

Pro Gin Tyr Ala Gly Ser Gly Asn Leu Thr Leu Ala Leu Glu Ala Lys 
290 295 300 



Thr Gly Lys Leu His Gin Glu Asn Val Leu Val Val Met Arg Ala Thr 
50 305 310 315 320 

Gin Leu Gin Lys Asn Leu Thr Cys Glu Val Trp Gly Pro Thr Ser Pro 
325 330 335 

55 Lys Leu Met Leu Ser Leu Lys Leu Glu Asn Lys Glu Ala Lys Val Ser 

340 345 350 



Lys Arg Glu Lys Ala Val Trp Val Leu Asn Pro Glu Ala Gly Met Trp 
355 360 365 

Gin Cys Leu Leu Ser Asp Ser Gly Gin Val Leu Leu Glu Ser Asn lie 
370 375 380 



Lys Val Leu Pro Thr Trp Ser Thr Pro Val Gin Pro Met Ala Leu lie 
65 385 390 395 400 

Val Leu Gly Gly Val Ala Gly Leu Leu Leu Phe lie Gly Leu Gly He 
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405 410 415 

Phe Phe Cys Val Arg Cys Arg His Arg Arg Arg Gin Ala Glu Arg Met 
420 425 430 

5 

Ser Gin lie Lys Arg Leu Leu Ser Glu Lys Lys Glu Cys Gin Cys Pro 
435 440 445 

His Arg Phe Gin Lys Thr Cys Ser Pro lie 
10 450 455 

"(2) INFORMATION FOR SEQ ID NO: 26: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

25 Met Asn Ser Gly Val Ala Met Lys Tyr Gly Asn Asp Ser Ser Ala Glu 

15 10 15 

Leu Ser Glu Leu His Ser Ala Ala Leu Ala Ser Leu Lys Gly Asp lie 
20 25 30 

Val Glu Leu Asn Lys Arg Leu Gin Gin Thr Glu Arg Glu Asp Leu Leu 
35 40 45 



30 



Glu Lys Lys Leu Ala Lys Ala Gin Cys Glu Gin Ser His Leu Met Arg 
35 50 55 60 

Glu His Glu Asp Val Gin Glu Arg Thr Thr Leu Arg Tyr Glu Glu Arg 
65 70 75 80 

4 0 lie Thr Glu Leu His Ser Val lie Ala Glu Leu Asn Lys Lys lie Asp 

85 90 95 



45 



60 



Arg Leu Gin Gly Thr Thr lie Arg Glu Glu Asp Glu Tyr Ser Glu Leu 

100 105 110 

Arg Ser Glu Leu Ser Gin Ser Gin His Glu Val Asn Glu Asp Ser Arg 

115 120 125 



Ser Met Asp Gin Asp Gin Thr Ser Val Ser lie Pro Glu Asn Gin Ser 
50 130 135 140 

Thr Met Val Thr Ala Asp Met Asp Asn Cys Ser Asp lie Asn Ser Glu 
145 150 155 160 

55 Leu Gin Arg Val Leu Thr Gly Leu Glu Asn Val Val Cys Gly Arg Lys 

165 170 175 



Lys Ser Ser Cys Ser Leu Ser Val Ala Glu Val Asp Arg His lie Glu 
180 185 190 

Gin Leu Thr Thr Ala Ser Glu His Cys Asp Leu Ala lie Lys Thr Val 
195 200 205 



Glu Glu lie Glu Gly Val Leu Gly Arg Asp Leu Tyr Pro Asn Leu Ala 
65 210 215 220 

Glu Glu Arg Ser Arg Trp Glu Lys Glu Leu Ala Gly Leu Arg Glu Glu 
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225 230 235 240 

Asn Glu Ser Leu Thr Ala Met Leu Cys Ser Lys Glu Glu Glu Leu Asn 
245 250 255 

Arg Thr Lys Ala Thr Met Asn Ala lie Arg Glu Glu Arg Asp Arg Leu 
260 265 270 

Arg Arg Arg Val Arg Glu Leu Gin Thr Arg Leu Gin Ser Val Gin Ala 
10 275 280 285 

Thr Gly Pro Ser Ser Pro Gly Arg Leu Thr Ser Thr Asn Arg Pro lie 
290 295 300 

15 Asn Pro Ser Thr Gly Glu Leu Ser Thr Ser Ser Ser Ser Asn Asp lie 

305 310 315 320 



20 



35 



50 



65 



Pro lie Ala Lys lie Ala Glu Arg Val Lys Leu Ser Lys Thr Arg Ser 

325 330 335 

Glu Ser Ser Ser Ser Asp Arg Pro Val Leu Gly Ser Glu lie Ser Ser 

340 345 350 



lie Gly Val Ser Ser Ser Val Ala Glu His Leu Ala His Ser Leu Gin 
25 355 360 365 

Asp Cys Ser Asn lie Gin Glu lie Phe Gin Thr Leu Tyr Ser His Gly 
370 375 380 

3 0 Ser Ala lie Ser Glu Ser Lys lie Arg Glu Phe Glu Val Glu Thr Glu 

385 390 395 400 



Arg Leu Asn Ser Arg lie Glu His Leu Lys Ser Gin Asn Asp Leu Leu 
405 410 415 

Thr lie Thr Leu Glu Glu Cys Lys Ser Asn Ala Glu Arg Met Ser Met 
420 425 430 



Leu Val Gly Lys Tyr Glu Ser Asn Ala Thr Ala Leu Arg Leu Ala Leu 
40 435 440 445 

Gin Tyr Ser Glu Gin Cys lie Glu Ala Tyr Glu Leu Leu Leu Ala Leu 
450 455 460 

45 Ala Glu Ser Glu Gin Ser Leu lie Leu Gly Gin Phe Arg Ala Ala Gly 

465 470 475 480 



Val Gly Ser Ser Pro Gly Asp Gin Ser Gly Asp Glu Asn He Thr Gin 
485 490 495 

Met Leu Lys Arg Ala His Asp Cys Arg Lys Thr Ala Glu Asn Ala Ala 
500 505 510 



Lys Ala Leu Leu Met Lys Leu Asp Gly Ser Cys Gly Gly Ala Phe Ala 
55 515 520 525 

Val Ala Gly Cys Ser Val Gin Pro Trp Glu Ser Leu Ser Ser Asn Ser 
530 535 540 

60 His Thr Ser Thr Thr Ser Ser Thr Ala Ser Ser Cys Asp Thr Glu Phe 

545 550 555 560 



Thr Lys Glu Asp Glu Gin Arg Leu Lys Asp Tyr He Gin Gin Leu Lys 

565 570 575 

Asn Asp Arg Ala Ala Val Lys Leu Thr Met Leu Glu Leu Glu Ser He 

580 585 590 
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His He Asp Pro Leu Ser Tyr Asp Val Lys Pro Arg Gly Asp Ser Gin 

595 600 605 

Arg Leu Asp Leu Glu Asn Ala Val Leu Met Gin Glu Leu Met Ala Met 

5 610 615 620 

Lys Glu Glu Met Ala Glu Leu Lys Ala Gin Leu Tyr Leu Leu Glu Lys 

625 630 635 640 

10 Glu Lys Lys Ala Leu Glu Leu Lys Leu Ser Thr Arg Glu Ala Gin Glu 

645 650 655 



15 



30 



45 



Gin Ala Tyr Leu Val His He Glu His Leu Lys Ser Glu Val Glu Glu 
660 665 670 

Gin Lys Glu Gin Arg Met Arg Ser Leu Ser Ser Thr Ser Ser Gly Ser 
675 680 685 



Lys Asp Lys Pro Gly Lys Glu Cys Ala Asp Ala Ala Ser Pro Ala Leu 
20 690 695 700 

Ser Leu Ala Glu Leu Arg Thr Thr Cys Ser Glu Asn Glu Leu Ala Ala 

705 710 715 720 

25 Glu Phe Thr Asn Ala He Arg Arg Glu Lys Lys Leu Lys Ala Arg Val 

725 730 735 



Gin Glu Leu Val Ser Ala Leu Glu Arg Leu Thr Lys Ser Ser Glu lie 
740 745 750 

Arg His Gin Gin Ser Ala Glu Phe Val Asn Asp Leu Lys Arg Ala Asn 
755 760 765 



Ser Asn Leu Val Ala Ala Tyr Glu Lys Ala Lys Lys Lys His Gin Asn 
35 770 775 780 

Lys Leu Lys Lys Leu Glu Ser Gin Met Met Ala Met Val Glu Arg His 
785 790 795 800 

4 0 Glu Thr Gin Val Arg Met Leu Lys Gin Arg lie Ala Leu Leu Glu Glu 

805 810 815 



Glu Asn Ser Arg Pro His Thr Asn Glu Thr Ser Leu 
820 825 



(2) INFORMATION FOR SEQ ID NO: 27: 



(i) SEQUENCE CHARACTERISTICS: 
5 0 (A) LENGTH: 672 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Ala Asp Val Phe Pro Gly Asn Asp Ser Thr Ala Ser Gin Asp Val 
60 1 5 10 15 

Ala Asn Arg Phe Ala Arg Lys Gly Ala Leu Arg Gin Lys Asn Val His 
20 25 30 

65 Glu Val Lys Asp His Lys Phe lie Ala Arg Phe Phe Lys Gin Pro Thr 

35 40 45 
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Phe Cys Ser His Cys Thr Asp Phe lie Trp Gly Phe Gly Lys Gly Gly 
50 55 60 

Phe Gin Cys Gin Val Cys Cys Phe Val Val His Lys Arg Cys His Glu 

5 65 70 75 80 

Phe Val Thr Phe Ser Cys Pro Gly Ala Asp Lys Gly Pro Asp Thr Asp 
85 90 95 

10 Asp Pro Arg Ser Lys His Lys Phe Lys lie His Thr Tyr Gly Ser Pro 

100 105 110 



15 



30 



45 



60 



Thr Phe Cys Asp His Cys Gly Ser Leu Leu Tyr Gly Leu He His Gin 
115 120 125 

Gly Met Lys Cys Asp Thr Cys Asp Met Asn Val His Lys Gin Cys Val 
130 135 140 



He Asn Val Pro Ser Leu Cys Gly Met Asp His Thr Glu Lys Arg Gly 
20 145 150 155 160 

Arg He Tyr Leu Lys Ala Glu Val Ala Asp Glu Lys Leu His Val Thr 
165 170 175 

2 5 Val Arg Asp Ala Lys Asn Leu lie Pro Met Asp Pro Asn Gly Leu Ser 

180 185 190 



Asp Pro Tyr Val Lys Leu Lys Leu He Pro Asp Pro Lys Asn Glu Ser 
195 200 205 

Lys Gin Lys Thr Lys Thr He Arg Ser Thr Leu Asn Pro Gin Trp Asn 
210 215 220 



Glu Ser Phe Thr Phe Lys Leu Lys Pro Ser Asp Lys Asp Arg Arg Leu 
35 225 230 235 240 

Ser Val Glu He Trp Asp Trp Asp Arg Thr Thr Arg Asn Asp Phe Met 
245 250 255 

4 0 Gly Ser Leu Ser Phe Gly Val Ser Glu Leu Met Lys Met Pro Ala Ser 

260 265 270 



Gly Trp Tyr Lys Leu Leu Asn Gin Glu Glu Gly Glu Tyr Tyr Asn Val 
275 280 285 

Pro He Pro Glu Gly Asp Glu Glu Gly Asn Met Glu Leu Arg Gin Lys 
290 295 300 



Phe Glu Lys Ala Lys Leu Gly Pro Ala Gly Asn Lys Val He Ser Pro 

50 305 310 315 320 

Ser Glu Asp Arg Lys Gin Pro Ser Asn Asn Leu Asp Arg Val Lys Leu 
325 330 335 

55 Thr Asp Phe Asn Phe Leu Met Val Leu Gly Lys Gly Ser Phe Gly Lys 

340 345 350 



Val Met Leu Ala Asp Arg Lys Gly Thr Glu Glu Leu Tyr Ala He Lys 
355 360 365 

He Leu Lys Lys Asp Val Val He Gin Asp Asp Asp Val Glu Cys Thr 
370 375 380 



Met Val Glu Lys Arg Val Leu Ala Leu Leu Asp Lys Pro Pro Phe Leu 
65 385 390 395 400 

Thr Gin Leu His Ser Cys Phe Gin Thr Val Asp Arg Leu Tyr Phe Val 
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405 410 415 

Met Glu Tyr Val Asn Gly Gly Asp Leu Met Tyr His lie Gin Gin Val 
420 425 430 

5 

Gly Lys Phe Lys Glu Pro Gin Ala Val Phe Tyr Ala Ala Glu lie Ser 
435 440 445 

lie Gly Leu Phe Phe Leu His Lys Arg Gly lie lie Tyr Arg Asp Leu 
10 450 455 460 

Lys Leu Asp Asn Val Met Leu Asp Ser Glu Gly His lie Lys lie Ala 
465 470 475 480 

15 Asp Phe Gly Met Cys Lys Glu His Met Met Asp Gly Val Thr Thr Arg 

485 490 495 



20 



35 



50 



Thr Phe Cys Gly Thr Pro Asp Tyr lie Ala Pro Glu lie lie Ala Tyr 
500 505 510 

Gin Pro Tyr Gly Lys Ser Val Asp Trp Trp Ala Tyr Gly Val Leu Leu 
515 520 525 



Tyr Glu Met Leu Ala Gly Gin Pro Pro Phe Asp Gly Glu Asp Glu Asp 

25 530 535 540 

Glu Leu Phe Gin Ser lie Met Glu His Asn Val Ser Tyr Pro Lys Ser 

545 550 555 560 

3 0 Leu Ser Lys Glu Ala Val Ser lie Cys Lys Gly Leu Met Thr Lys His 

565 570 575 



Pro Ala Lys Arg Leu Gly Cys Gly Pro Glu Gly Glu Arg Asp Val Arg 

580 585 590 

Glu His Ala Phe Phe Arg Arg lie Asp Trp Glu Lys Leu Glu Asn Arg 
595 600 605 



Glu lie Gin Pro Pro Phe Lys Pro Lys Val Cys Gly Lys Gly Ala Glu 
40 610 615 620 

Asn Phe Asp Lys Phe Phe Thr Arg Gly Gin Pro Val Leu Thr Pro Pro 
625 630 635 640 

4 5 Asp Gin Leu Val lie Ala Asn lie Asp Gin Ser Asp Phe Glu Gly Phe 

645 650 655 



Ser Tyr Val Asn Pro Gin Phe Val His Pro lie Leu Gin Ser Ala Val 
660 665 670 



(2) INFORMATION FOR SEQ ID NO: 28: 



55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

60 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

6 5 Met Asp lie Leu Cys Glu Glu Asn Thr Ser Leu Ser Ser Thr Thr Asn 

15 10 15 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 




Ser Leu Met Gin Leu 
20 

Asn Ser Gly Glu Ala 
35 

Ser Glu Asn Arg Thr 
50 

Cys Leu Ser Leu Leu 
65 

Thr Ala Val Val lie 
85 

Met Ala Val Ser Leu 
100 

Leu Met Ser Leu Ala 
115 

Pro Val Ser Met Leu 
130 

Ser Lys Leu Cys Ala 
145 

Ala Ser He Met His 
165 

He Gin Asn Pro He 
180 

Phe Leu Lys He He 
195 

Pro He Pro Val Phe 
210 

Gly Ser Cys Leu Leu 
225 

Val Ser Phe Phe He 
245 

Thr He Lys Ser Leu 
260 

Gly Thr Arg Ala Lys 
275 

Leu Ser Ser Glu Lys 
290 

Ser Tyr Thr Gly Arg 
305 

Ala Cys Lys Val Leu 
325 

Cys Pro Phe Phe He 
340 

Cys Asn Glu Asp Val 
355 

Gly Tyr Leu Ser Ser 




-50- 



Asn Asp Asp Thr Arg Leu 
25 

Asn Thr Ser Asp Ala Phe 
40 

Asn Leu Ser Cys Glu Gly 
55 

His Leu Gin Glu Lys Asn 
70 75 

He Leu Thr He Ala Gly 
90 

Glu Lys Lys Leu Gin Asn 
105 

He Ala Asp Met Leu Leu 
120 

Thr He Leu Tyr Gly Tyr 
135 

Val Trp He Tyr Leu Asp 
150 155 

Leu Cys Ala He Ser Leu 
170 

His His Ser Arg Phe Asn 
185 

Ala Val Trp Thr He Ser 
200 

Gly Leu Gin Asp Asp Ser 
215 

Ala Asp Asp Asn Phe Val 
230 235 

Pro Leu Thr He Met Val 
250 

Gin Lys Glu Ala Thr Leu 
265 

Leu Ala Ser Phe Ser Phe 
280 

Leu Phe Gin Arg Ser He 
295 

Arg Thr Met Gin Ser He 
310 315 

Gly He Val Phe Phe Leu 
330 

Thr Asn He Met Ala Val 
345 

He Gly Ala Leu Leu Asn 
360 

Ala Val Asn Pro Leu Val 
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Tyr Ser Asn Asp Phe 
30 

Asn Trp Thr Val Asp 
45 

Cys Leu Ser Pro Ser 
60 

Trp Ser Ala Leu Leu 
80 

Asn He Leu Val He 
95 

Ala Thr Asn Tyr Phe 
110 

Gly Phe Leu Val Met 
125 

Arg Trp Pro Leu Pro 
140 

Val Leu Phe Ser Thr 
160 

Asp Arg Tyr Val Ala 
175 

Ser Arg Thr Lys Ala 
190 

Val Gly He Ser Met 
205 

Lys Val Phe Lys Glu 
220 

Leu He Gly Ser Phe 
240 

He Thr Tyr Phe Leu 
255 

Cys Val Ser Asp Leu 
270 

Leu Pro Gin Ser Ser 
285 

His Arg Glu Pro Gly 
300 

Ser Asn Glu Gin Lys 
320 

Phe Val Val Met Trp 
335 

He Cys Lys Glu Ser 
350 

Val Phe Val Trp He 
365 

Tyr Thr Leu Phe Asn 
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370 375 380 

Lys Thr Tyr Arg Ser Ala Phe Ser Arg Tyr lie Gin Cys Gin Tyr Lys 
385 390 395 400 

5 

Glu Asn Lys Lys Pro Leu Gin Leu lie Leu Val Asn Thr lie Pro Ala 
405 410 415 

Leu Ala Tyr Lys Ser Ser Gin Leu Gin Met Gly Gin Lys Lys Asn Ser 
10 420 425 430 

Lys Gin Asp Ala Lys Thr Thr Asp Asn Asp Cys Ser Met Val Ala Leu 
435 440 445 

15 Gly Lys Gin His Ser Glu Glu Ala Ser Lys Asp Asn Ser Asp Gly Val 

450 455 460 



20 



45 



60 



Asn Glu Lys Val Ser Cys Val 
465 470 



(2) INFORMATION FOR SEQ ID NO: 29: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 481 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

3 0 (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 

Met Ala Leu Ser Tyr Arg Val Ser Glu Leu Gin Ser Thr lie Pro Glu 
35 1 5 10 15 

His lie Leu Gin Ser Thr Phe Val His Val lie Ser Ser Asn Trp Ser 
20 25 30 

4 0 Gly Leu Gin Thr Glu Ser lie Pro Glu Glu Met Lys Gin lie Val Glu 

35 40 45 



Glu Gin Gly Asn Lys Leu His Trp Ala Ala Leu Leu lie Leu Met Val 
50 55 60 

lie lie Pro Thr lie Gly Gly Asn Thr Leu Val lie Leu Ala Val Ser 
65 70 75 80 



Leu Glu Lys Lys Leu Gin Tyr Ala Thr Asn Tyr Phe Leu Met Ser Leu 

50 85 90 95 

Ala Val Ala Asp Leu Leu Val Gly Leu Phe Val Met Pro lie Ala Leu 
100 105 110 

5 5 Leu Thr lie Met Phe Glu Ala Met Trp Pro Leu Pro Leu Val Leu Cys 

115 120 125 



Pro Ala Trp Leu Phe Leu Asp Val Leu Phe Ser Thr Ala Ser lie Met 
130 135 140 

His Leu Cys Ala lie Ser Val Asp Arg Tyr lie Ala lie Lys Lys Pro 
145 150 155 160 



lie Gin Ala Asn Gin Tyr Asn Ser Arg Ala Thr Ala Phe lie Lys lie 
65 165 170 175 

Thr Val Val Trp Leu lie Ser lie Gly lie Ala lie Pro Val Pro lie 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 (2) 
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180 

Lys Gly lie Glu Thr 
195 

Leu Thr Lys Glu Arg 
210 

Ala Phe Phe Thr Pro 
225 

lie His Ala Leu Gin 
245 

Gin Arg Leu Thr Trp 
260 

Thr Pro Cys Ser Ser 
275 

Lys Asp Lys Ala Leu 
290 

Thr Ser Thr lie Gly 
305 

Arg Ala Ser Lys Val 
325 

Trp Cys Pro Phe Phe 
340 

Cys Asn Gin Thr Thr 
355 

Gly Tyr Val Ser Ser 
370 

Lys Thr Phe Arg Asp 
385 

Ala Thr Lys Ser Val 
405 

Phe Arg Asn Pro Met 
420 

lie Arg Asn Gly lie 
435 

Arg Ser Ser Thr lie 
450 

Leu Leu Thr Glu Asn 
465 

Val 



185 

Asp Val Asp Asn Pro Asn 
200 

Phe Gly Asp Phe Met Leu 
215 

Leu Ala He Met He Val 
230 235 

Lys Lys Ala Tyr Leu Val 
250 

Leu Thr Val Ser Thr Val 
265 

Pro Glu Lys Val Ala Met 
280 

Pro Asn Ser Gly Asp Glu 
295 

Lys Lys Ser Val Gin Thr 
310 315 

Leu Gly He Val Phe Phe 
330 

He Thr Asn He Thr Leu 
345 

Leu Gin Met Leu Leu Glu 
360 

Gly Val Asn Pro Leu Val 
375 

Ala Phe Gly Arg Tyr He 
390 395 

Lys Thr Leu Arg Lys Arg 
410 

Ala Glu Asn Ser Lys Phe 
425 

Asn Pro Ala Met Tyr Gin 
440 

Gin Ser Ser Ser He He 
455 

Glu Gly Asp Lys Thr Glu 
470 475 



190 

Asn He Thr Cys Val 
205 

Phe Gly Ser Leu Ala 
220 

Thr Tyr Phe Leu Thr 
240 

Lys Asn Lys Pro Pro 
255 

Phe Gin Arg Asp Glu 
270 

Leu Asp Gly Ser Arg 
285 

Thr Leu Met Arg Arg 
300 

He Ser Asn Glu Gin 
320 

Leu Phe Leu Leu Met 
335 

Val Leu Cys Asp Ser 
350 

He Phe Val Trp He 
365 

Tyr Thr Leu Phe Asn 
380 

Thr Cys Asn Tyr Arg 
400 

Ser Ser Lys He Tyr 
415 

Phe Lys Lys His Gly 
430 

Ser Pro Met Arg Leu 
445 

Leu Leu Asp Thr Leu 
460 

Glu Gin Val Ser Val 
480 



INFORMATION FOR SEQ ID. NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2843 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



WO 98/05347 PCT/US97/12677 

-53- 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

5 Met Ala Ala Ala Ser Tyr Asp Gin Leu Leu Lys Gin Val Glu Ala Leu 

15 10 15 



10 



25 



40 



55 



Lys Met Glu Asn Ser Asn Leu Arg Gin Glu Leu Glu Asp Asn Ser Asn 
20 25 30 

His Leu Thr Lys Leu Glu Thr Glu Ala Ser Asn Met Lys Glu Val Leu 
35 40 45 



Lys Gin Leu Gin Gly Ser lie Glu Asp Glu Ala Met Ala Ser Ser Gly 
15 50 55 60 

Gin lie Asp Leu Leu Glu Arg Leu Lys Glu Leu Asn Leu Asp Ser Ser 
65 70 75 80 

2 0 Asn Phe Pro Gly Val Lys Leu Arg Ser Lys Met Ser Leu Arg Ser Tyr 

85 90 95 



Gly Ser Arg Glu Gly Ser Val Ser Ser Arg Ser Gly Glu Cys Ser Pro 
100 105 110 

Val Pro Met Gly Ser Phe Pro Arg Arg Gly Phe Val Asn Gly Ser Arg 
115 120 125 



Glu Ser Thr Gly Tyr Leu Glu Glu Leu Glu Lys Glu Arg Ser Leu Leu 

30 130 135 140 

Leu Ala Asp Leu Asp Lys Glu Glu Lys Glu Lys Asp Trp Tyr Tyr Ala 

145 150 155 160 

3 5 Gin Leu Gin Asn Leu Thr Lys Arg lie Asp Ser Leu Pro Leu Thr Glu 

165 170 175 



Asn Phe Ser Leu Gin Thr Asp Met Thr Arg Arg Gin Leu Glu Tyr Glu 
180 185 190 

Ala Arg Gin lie Arg Val Ala Met Glu Glu Gin Leu Gly Thr Cys Gin 
195 200 205 



Asp Met Glu Lys Arg Ala Gin Arg Arg lie Ala Arg lie Gin Gin lie 
45 210 215 220 

Glu Lys Asp lie Leu Arg lie Arg Gin Leu Leu Gin Ser Gin Ala Thr 
225 230 235 240 

50 Glu Ala Glu Arg Ser Ser Gin Asn Lys His Glu Thr Gly Ser His Asp 

245 250 255 

Ala Glu Arg Gin Asn Glu Gly Gin Gly Val Gly Glu lie Asn Met Ala 
260 265 270 



Thr Ser Gly Asn Gly Gin Gly Ser Thr Thr Arg Met Asp His Glu Thr 
275 280 285 



Ala Ser Val Leu Ser Ser Ser Ser Thr His Ser Ala Pro Arg Arg Leu 
60 290 295 300 

Thr Ser His Leu Gly Thr Lys Val Glu Met Val Tyr Ser Leu Leu Ser 
305 310 315 320 

6 5 Met Leu Gly Thr His Asp Lys Asp Asp Met Ser Arg Thr Leu Leu Ala 

325 330 335 
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Met Ser Ser Ser Gin Asp Ser Cys lie Ser Met Arg Gin Ser Gly Cys 
340 345 350 

Leu Pro L.eu Leu lie Gin Leu Leu His Gly Asn Asp Lys Asp Ser Val 
5 355 360 365 

Leu Leu Gly Asn Ser Arg Gly Ser Lys Glu Ala Arg Ala Arg Ala Ser 
370 375 380 

10 Ala Ala Leu His Asn lie lie His Ser Gin Pro Asp Asp Lys Arg Gly 

385 390 395 400 



15 



30 



45 



60 



Arg Arg Glu lie Arg Val Leu His Leu Leu Glu Gin lie Arg Ala Tyr 
405 410 415 

Cys Ser Thr Cys Trp Glu Trp Gin Glu Ala His Glu Pro Gly Met Asp 
420 425 430 



Gin Asp Lys Asn Pro Met Pro Ala Pro Val Glu His Gin lie Cys Pro 
20 435 440 445 

Ala Val Cys Val Leu Met Lys Leu Ser Phe Asp Glu Glu His Arg His 
450 455 460 

2 5 Ala Met Asn Glu Leu Gly Gly Leu Gin Ala lie Ala Glu Leu Leu Gin 

465 470 475 480 



Val Asp Cys Glu Met Tyr Gly Leu Thr Asn Asp His Tyr Ser lie Thr 
485 490 495 

Leu Arg Arg Tyr Ala Gly Met Ala Leu Thr Asn Leu Thr Phe Gly Asp 
500 505 510 



Val Ala Asn Lys Ala Thr Leu Cys Ser Met Lys Gly Cys Met Arg Ala 
35 515 520 525 

Leu Val Ala Gin Leu Lys Ser Glu Ser Glu Asp Leu Gin Gin Val lie 
530 535 540 

4 0 ' Ala Ser Val Leu Arg Asn Leu Ser Trp Arg Ala Asp Val Asn Ser Lys 

545 550 555 560 



Lys Thr Leu Arg Glu Val Gly Ser Val Lys Ala Leu Met Glu Cys Ala 

565 570 575 

Leu Glu Val Lys Lys Glu Ser Thr Leu Lys Ser Val Leu Ser Ala Leu 
580 585 590 



Trp Asn Leu Ser Ala His Cys Thr Glu Asn Lys Ala Asp lie Cys Ala 
50 595 600 605 

Val Asp Gly Ala Leu Ala Phe Leu Val Gly Thr Leu Thr Tyr Arg Ser 
610 615 620 

55 Gin Thr Asn Thr Leu Ala lie lie Glu Ser Gly Gly Gly lie Leu Arg 

625 630 635 640 



Asn Val Ser Ser Leu lie Ala Thr Asn Glu Asp His Arg Gin lie Leu 

645 650 655 

Arg Glu Asn Asn Cys Leu Gin Thr Leu Leu Gin His Leu Lys Ser His 
660 665 670 



Ser Leu Thr lie Val Ser Asn Ala Cys Gly Thr Leu Trp Asn Leu Ser 
65 675 680 685 

Ala Arg Asn Pro Lys Asp Gin Glu Ala Leu Trp Asp Met Gly Ala Val 
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690 695 700 

Ser Met Leu Lys Asn lieu lie His Ser Lys His Lys Met lie Ala Met 
705 710 715 720 

5 

Gly Ser Ala Ala Ala Leu Arg Asn Leu Met Ala Asn Arg Pro Ala Lys 
725 730 735 

Tyr Lys Asp Ala Asn lie Met Ser Pro Gly Ser Ser Leu Pro Ser Leu 
10 740 745 750 

His Val Arg Lys Gin Lys Ala Leu Glu Ala Glu Leu Asp Ala Gin His 
755 760 765 

15 Leu Ser Glu Thr Phe Asp Asn lie Asp Asn lie Ser Pro Lys Ala Ser 

770 775 780 



20 



35 



50 



65 



His Arg Ser Lys Gin Arg His Lys Gin Ser Leu Tyr Gly Asp Tyr Val 
785 790 795 800 

Phe Asp Thr Asn Arg His Asp Asp Asn Arg Ser Asp Asn Phe Asn Thr 

805 810 815 



Gly Asn Met Thr Val Leu Ser Pro Tyr Leu Asn Thr Thr Val Leu Pro 
25 820 825 830 

Ser Ser Ser Ser Ser Arg Gly Ser Leu Asp Ser Ser Arg Ser Glu Lys 
835 840 845 

3 0 Asp Arg Ser Leu Glu Arg Glu Arg Gly lie Gly Leu Gly Asn Tyr His 

850 855 860 



Pro Ala Thr Glu Asn Pro Gly Thr Ser Ser Lys Arg Gly Leu Gin He 

865 870 875 880 

Ser Thr Thr Ala Ala Gin He Ala Lys Val Met Glu Glu Val Ser Ala 

885 890 895 



He His Thr Ser Gin Glu Asp Arg Ser Ser Gly Ser Thr Thr Glu Leu 
40 900 905 910 

His Cys Val Thr Asp Glu Arg Asn Ala Leu Arg Arg Ser Ser Ala Ala 
915 920 925 

45 His Thr His Ser Asn Thr Tyr Asn Phe Thr Lys Ser Glu Asn Ser Asn 

930 935 940 



Arg Thr Cys Ser Met Pro Tyr Ala Lys Leu Glu Tyr Lys Arg Ser Ser 
945 950 955 960 

Asn Asp Ser Leu Asn Ser Val Ser Ser Ser Asp Gly Tyr Gly Lys Arg 
965 970 975 



Gly Gin Met Lys Pro Ser He Glu Ser Tyr Ser Glu Asp Asp Glu Ser 
55 980 985 990 

Lys Phe Cys Ser Tyr Gly Gin Tyr Pro Ala Asp Leu Ala His Lys He 
995 1000 1005 

6 0 His Ser Ala Asn His Met Asp Asp Asn Asp Gly Glu Leu Asp Thr Pro 

1010 1015 1020 



He Asn Tyr Ser Leu Lys Tyr Ser Asp Glu Gin Leu Asn Ser Gly Arg 
1025 1030 1035 1040 

Gin Ser Pro Ser Gin Asn Glu Arg Trp Ala Arg Pro Lys His He He 
1045 1050 1055 
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Glu Asp Glu lie Lys Gin Ser Glu Gin Arg Gin Ser Arg Asn Gin Ser 
1060 1065 1070 

Thr Thr Tyr Pro Val Tyr Thr Glu Ser Thr Asp Asp Lys His Leu Lys 
5 1075 1080 1085 

Phe Gin Pro His Phe Gly Gin Gin Glu Cys Val Ser Pro Tyr Arg Ser 
1090 1095 1100 

10 Arg Gly Ala Asn Gly Ser Glu Thr Asn Arg Val Gly Ser Asn His Gly 

1105 1110 1115 1120 



15 



30 



45 



60 



lie Asn Gin Asn Val Ser Gin Ser Leu Cys Gin Glu Asp Asp Tyr Glu 
1125 1130 1135 

Asp Asp Lys Pro Thr Asn Tyr Ser Glu Arg Tyr Ser Glu Glu Glu Gin 
1140 1145 1150 



His Glu Glu Glu Glu Arg Pro Thr Asn Tyr Ser lie Lys Tyr Asn Glu 
20 1155 1160 1165 

Glu Lys Arg His Val Asp Gin Pro lie Asp Tyr Ser lie Leu Lys Ala 
1170 1175 1180 

2 5 Thr Asp lie Pro Ser Ser Gin Lys Gin Ser Phe Ser Phe Ser Lys Ser 

1185 1190 1195 1200 



Ser Ser Gly Gin Ser Ser Lys Thr Glu His Met Ser Ser Ser Ser Glu 
1205 1210 1215 

Asn Thr Ser Thr Pro Ser Ser Asn Ala Lys Arg Gin Asn Gin Leu His 
1220 1225 1230 



Pro Ser Ser Ala Gin Ser Arg Ser Gly Gin Pro Gin Lys Ala Ala Thr 
35 1235 1240 1245 

Cys Lys Val Ser Ser lie Asn Gin Glu Thr lie Gin Thr Tyr Cys Val 
1250 1255 1260 

4 0 Glu Asp Thr Pro lie Cys Phe Ser Arg Cys Ser Ser Leu Ser Ser Leu 

1265 1270 1275 1280 



Ser Ser Ala Glu Asp Glu lie Gly Cys Asn Gin Thr Thr Gin Glu Ala 
1285 1290 1295 

Asp Ser Ala Asn Thr Leu Gin lie Ala Glu lie Lys Glu Lys lie Gly 
1300 1305 1310 



Thr Arg Ser Ala Glu Asp Pro Val Ser Glu Val Pro Ala Val Ser Gin 
50 1315 1320 1325 

His Pro Arg Thr Lys Ser Ser Arg Leu Gin Gly Ser Ser Leu Ser Ser 
1330 1335 1340 

55 Glu Ser Ala Arg His Lys Ala Val Glu Phe Ser Ser Gly Ala Lys Ser 

1345 1350 1355 1360 



Pro Ser Lys Ser Gly Ala Gin Thr Pro Lys Ser Pro Pro Glu His Tyr 
1365 1370 1375 

Val Gin Glu Thr Pro Leu Met Phe Ser Arg Cys Thr Ser Val Ser Ser 
1380 1385 1390 



Leu Asp Ser Phe Glu Ser Arg Ser lie Ala Ser Ser Val Gin Ser Glu 
65 1395 1400 1405 

Pro Cys Ser Gly Met Val Ser Gly lie lie Ser Pro Ser Asp Leu Pro 
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1410 1415 1420 

Asp Ser Pro Gly Gin Thr Met Pro Pro Ser Arg Ser Lys Thr Pro Pro 
1425 1430 1435 1440 

Pro Pro Pro Gin Thr Ala Gin Thr Lys Arg Glu Val Pro Lys Asn Lys 
1445 1450 1455 

Ala Pro Thr Ala Glu Lys Arg Glu Ser Gly Pro Lys Gin Ala Ala Val 
1460 1465 1470 

Asn Ala Ala Val Gin Arg Val Gin Val Leu Pro Asp Ala Asp Thr Leu 
1475 1480 1485 

Leu His Phe Ala Thr Glu Ser Thr Pro Asp Gly Phe Ser Cys Ser Ser 
1490 1495 1500 

Ser Leu Ser Ala Leu Ser Leu Asp Glu Pro Phe lie Gin Lys Asp Val 
1505 1510 1515 1520 

Glu Leu Arg lie Met Pro Pro Val Gin Glu Asn Asp Asn Gly Asn Glu 
1525 1530 1535 

Thr Glu Ser Glu Gin Pro Lys Glu Ser Asn Glu Asn Gin Glu Lys Glu 
1540 1545 1550 

Ala Glu Lys Thr lie Asp Ser Glu Lys Asp Leu Leu Asp Asp Ser Asp 
1555 1560 1565 

Asp Asp Asp lie Glu lie Leu Glu Glu Cys lie lie Ser Ala Met Pro 
1570 1575 1580 

Thr Lys Ser Ser Arg Lys Ala Lys Lys Pro Ala Gin Thr Ala Ser Lys 
1585 1590 1595 1600 

Leu Pro Pro Pro Val Ala Arg Lys Pro Ser Gin Leu Pro Val Tyr Lys 
1605 1610 1615 

Leu Leu Pro Ser Gin Asn Arg Leu Gin Pro Gin Lys His Val Ser Phe 
1620 1625 1630 

Thr Pro Gly Asp Asp Met Pro Arg Val Tyr Cys Val Glu Gly Thr Pro 
1635 1640 1645 

lie Asn Phe Ser Thr Ala Thr Ser Leu Ser Asp Leu Thr lie Glu Ser 
1650 1655 1660 

Pro Pro Asn Glu Leu Ala Ala Gly Glu Gly Val Arg Gly Gly Ala Gin 
1665 1670 1675 1680 

Ser Gly Glu Phe Glu Lys Arg Asp Thr lie Pro Thr Glu Gly Arg Ser 
1685 1690 1695 

Thr Asp Glu Ala Gin Gly Gly Lys Thr Ser Ser Val Thr He Pro Glu 
1700 1705 1710 

Leu Asp Asp Asn Lys Ala Glu Glu Gly Asp He Leu Ala Glu Cys He 
1715 1720 1725 

Asn Ser Ala Met Pro Lys Gly Lys Ser His Lys Pro Phe Arg Val Lys 
1730 1735 1740 

Lys He Met Asp Gin Val Gin Gin Ala Ser Ala Ser Ser Ser Ala Pro 
1745 1750 1755 1760 

Asn Lys Asn Gin Leu Asp Gly Lys Lys Lys Lys Pro Thr Ser Pro Val 
1765 1770 1775 
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Lys Pro lie Pro Gin Asn Thr Glu Tyr Arg Thr Arg Val Arg Lys Asn 
1780 1785 1790 

Ala Asp Ser Lys Asn Asn Leu Asn Ala Glu Arg Val Phe Ser Asp Asn 
1795 1800 1805 

Lys Asp Ser Lys Lys Gin Asn Leu Lys Asn Asn Ser Lys Asp Phe Asn 
1810 1815 1820 

Asp Lys Leu Pro Asn Asn Glu Asp Arg Val Arg Gly Ser Phe Ala Phe 
1825 1830 1835 1840 

Asp Ser Pro His His Tyr Thr Pro lie Glu Gly Thr Pro Tyr Cys Phe 
1845 1850 1855 

Ser Arg Asn Asp Ser Leu Ser Ser Leu Asp Phe Asp Asp Asp Asp Val 
1860 1865 1870 

Asp Leu Ser Arg Glu Lys Ala Glu Leu Arg Lys Ala Lys Glu Asn Lys 
1875 1880 1885 

Glu Ser Glu Ala Lys Val Thr Ser His Thr Glu Leu Thr Ser Asn Gin 
1890 1895 1900 

Gin Ser Ala Asn Lys Thr Gin Ala lie Ala Lys Gin Pro lie Asn Arg 
1905 1910 1915 1920 

Gly Gin Pro Lys Pro lie Leu Gin Lys Gin Ser Thr Phe Pro Gin Ser 
1925 1930 1935 

Ser Lys Asp lie Pro Asp Arg Gly Ala Ala Thr Asp Glu Lys Leu Gin 
1940 1945 1950 

Asn Phe Ala lie Glu Asn Thr Pro Val Cys Phe Ser His Asn Ser Ser 
1955 1960 1965 

Leu Ser Ser Leu Ser Asp lie Asp Gin Glu Asn Asn Asn Lys Glu Asn 
1970 1975 1980 

Glu Pro lie Lys Glu Thr Glu Pro Pro Asp Ser Gin Gly Glu Pro Ser 
1985 1990 1995 2000 

Lys Pro Gin Ala Ser Gly Tyr Ala Pro Lys Ser Phe His Val Glu Asp 
2005 2010 2015 

Thr Pro Val Cys Phe Ser Arg Asn Ser Ser Leu Ser Ser Leu Ser lie 
2020 2025 2030 

Asp Ser Glu Asp Asp Leu Leu Gin Glu Cys lie Ser Ser Ala Met Pro 
2035 2040 2045 

Lys Lys Lys Lys Pro Ser Arg Leu Lys Gly Asp Asn Glu Lys His Ser 
2050 2055 2060 

Pro Arg Asn Met Gly Gly lie Leu Gly Glu Asp Leu Thr Leu Asp Leu 
2065 2070 2075 2080 

Lys Asp lie Gin Arg Pro Asp Ser Glu His Gly Leu Ser Pro Asp Ser 
2085 2090 2095 

Glu Asn Phe Asp Trp Lys Ala lie Gin Glu Gly Ala Asn Ser lie Val 
2100 2105 2110 

Ser Ser Leu His Gin Ala Ala Ala Ala Ala Cys Leu Ser Arg Gin Ala 
2115 2120 2125 

Ser Ser Asp Ser Asp Ser lie Leu Ser Leu Lys Ser Gly lie Ser Leu 
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2130 2135 2140 

Gly Ser Pro Phe His Leu Thr Pro Asp Gin Glu Glu Lys Pro Phe Thr 
2145 2150 2155 2160 

5 

Ser Asn Lys Gly Pro Arg lie Leu Lys Pro Gly Glu Lys Ser Thr Leu 
2165 2170 2175 

Glu Thr Lys Lys lie Glu Ser Glu Ser Lys Gly lie Lys Gly Gly Lys 
10 2180 2185 2190 

Lys Val Tyr Lys Ser Leu lie Thr Gly Lys Val Arg Ser Asn Ser Glu 
2195 2200 2205 

15 lie Ser Gly Gin Met Lys Gin Pro Leu Gin Ala Asn Met Pro Ser lie 

2210 2215 2220 

Ser Arg Gly Arg Thr Met lie His lie Pro Gly Val Arg Asn Ser Ser 
2225 2230 2235 2240 



20 



35 



50 



65 



Ser Ser Thr Ser Pro Val Ser Lys Lys Gly Pro Pro Leu Lys Thr Pro 
2245 2250 2255 



Ala Ser Lys Ser Pro Ser Glu Gly Gin Thr Ala Thr Thr Ser Pro Arg 
25 2260 2265 2270 

Gly Ala Lys Pro Ser Val Lys Ser Glu Leu Ser Pro Val Ala Arg Gin 
2275 2280 2285 

3 0 Thr Ser Gin lie Gly Gly Ser Ser Lys Ala Pro Ser Arg Ser Gly Ser 

2290 2295 2300 

Arg Asp Ser Thr Pro Ser Arg Pro Ala Gin Gin Pro Leu Ser Arg Pro 

2305 2310 2315 2320 



lie Gin Ser Pro Gly Arg Asn Ser lie Ser Pro Gly Arg Asn Gly lie 
2325 2330 2335 



Ser Pro Pro Asn Lys lie Ser Gin Leu Pro Arg Thr Ser Ser Pro Ser 
40 2340 2345 2350 

Thr Ala Ser Thr Lys Ser Ser Gly Ser Gly Lys Met Ser Tyr Thr Ser 
2355 2360 2365 

4 5 Pro Gly Arg Gin Met Ser Gin Gin Asn Leu Thr Lys Gin Thr Gly Leu 

2370 2375 2380 



Ser Lys Asn Ala Ser Ser lie Pro Arg Ser Glu Ser Ala Ser Lys Gly 
2385 2390 2395 2400 

Leu Asn Gin Met Asn Asn Gly Asn Gly Ala Asn Lys Lys Val Glu Leu 
2405 2410 2415 



Ser Arg Met Ser Ser Thr Lys Ser Ser Gly Ser Glu Ser Asp Arg Ser 
55 2420 2425 2430 

Glu Arg Pro Val Leu Val Arg Gin Ser Thr Phe lie Lys Glu Ala Pro 
2435 2440 2445 

60 Ser Pro Thr Leu Arg Arg Lys Leu Glu Glu Ser Ala Ser Phe Glu Ser 

2450 2455 2460 



Leu Ser Pro Ser Ser Arg Pro Ala Ser Pro Thr Arg Ser Gin Ala Gin 
2465 2470 2475 2480 

Thr Pro Val Leu Ser Pro Ser Leu Pro Asp Met Ser Leu Ser Thr His 
2485 2490 2495 
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Ser Ser Val Gin Ala Gly Gly Trp Arg Lys Leu Pro Pro Asn Leu Ser 
2500 2505 2510 

Pro Thr lie Glu Tyr Asn Asp Gly Arg Pro Ala Lys Arg His Asp lie 
5 2515 2520 2525 

Ala Arg Ser His Ser Glu Ser Pro Ser Arg Leu Pro lie Asn Arg Ser 
2530 2535 2540 

10 Gly Thr Trp Lys Arg Glu His Ser Lys His Ser Ser Ser Leu Pro Arg 

2545 2550 2555 2560 



15 



30 



45 



60 



Val Ser Thr Trp Arg Arg Thr Gly Ser Ser Ser Ser lie Leu Ser Ala 
2565 2570 2575 

Ser Ser Glu Ser Ser Glu Lys Ala Lys Ser Glu Asp Glu Lys His Val 
2580 2585 2590 



Asn Ser lie Ser Gly Thr Lys Gin Ser Lys Glu Asn Gin Val Ser Ala 
20 2595 2600 2605 

Lys Gly Thr Trp Arg Lys lie Lys Glu Asn Glu Phe Ser Pro Thr Asn 
2610 2615 2620 

2 5 Ser Thr Ser Gin Thr Val Ser Ser Gly Ala Thr Asn Gly Ala Glu Ser 

2625 2630 2635 2640 



Lys Thr Leu lie Tyr Gin Met Ala Pro Ala Val Ser Lys Thr Glu Asp 
2645 2650 2655 

Val Trp Val Arg lie Glu Asp Cys Pro lie Asn Asn Pro Arg Ser Gly 
2660 2665 2670 



Arg Ser Pro Thr Gly Asn Thr Pro Pro Val lie Asp Ser Val Ser Glu 
35 2675 2680 2685 

Lys Ala Asn Pro Asn lie Lys Asp Ser Lys Asp Asn Gin Ala Lys Gin 
2690 2695 2700 

4 0 Asn Val Gly Asn Gly Ser Val Pro Met Arg Thr Val Gly Leu Glu Asn 

2705 2710 2715 2720 



Arg Leu Asn Ser Phe lie Gin Val Asp Ala Pro Asp Gin Lys Gly Thr 
2725 2730 2735 

Glu lie Lys Pro Gly Gin Asn Asn Pro Val Pro Val Ser Glu Thr Asn 
2740 2745 2750 



Glu Ser Ser lie Val Glu Arg Thr Pro Phe Ser Ser Ser Ser Ser Ser 
50 2755 2760 2765 

Lys His Ser Ser Pro Ser Gly Thr Val Ala Ala Arg Val Thr Pro Phe 
2770 2775 2780 

55 Asn Tyr Asn Pro Ser Pro Arg Lys Ser Ser Ala Asp Ser Thr Ser Ala 

2785 2790 2795 2800 



Arg Pro Ser Gin lie Pro Thr Pro Val Asn Asn Asn Thr Lys Lys Arg 
2805 2810 2815 

Asp Ser Lys Thr Asp Ser Thr Glu Ser Ser Gly Thr Gin Ser Pro Lys 

2820 2825 2830 



Arg His Ser Gly Ser Tyr Leu Val Thr Ser Val 
65 2835 2840 
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( 2 ) INFORMATION FOR SEQ ID NO : 3 1 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 65 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

CGGAATTCNN NNNNNNNAAC AGCNNNNNNN NNAATGAANN NCAAAGTCTG NNNTGAGGAT 6 0 
CCTCA 65 



10 



20 



35 



40 



50 



(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS : 

2 5 (A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

3 0 (ii) MOLECULE TYPE: other nucleic acid 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CGGAATTCGA CTCAGAANNN NNNAACTTCA GANNNNNNAT CNNNNNNNNN GTCTGAGGAT 6 0 
CCTCA 65 



(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 5 base pairs 
4 5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: other nucleic acid 
(iv) ANTI - SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
55 CGGAATTCNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNTGAGGAT 6 0 

CCTCA 65 
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What is claimed is : 



1 . A composition capable of inhibiting specific binding 
5 between a signal -transducing protein and a 

cytoplasmic protein containing the amino acid 
sequence (G/S/A/E) -L-G- (F/I/L) , wherein each 
represents a peptide bond, each parenthesis encloses 
amino acids which are alternatives to one other, and 
10 each slash within such parentheses separating the 

alternative amino acids. 



2. The composition of claim 1, wherein the cytoplasmic 
protein contains the amino acid sequence (K/R/Q) -X n - 

15 (G/S/A/E) -L-G- (F/I/L) , wherein X represents any 

amino acid which is selected from the group 
comprising the twenty naturally occurring amino 
acids and n represents at least 2, but not more than 
4 . 

20 

3. The composition of claim 1, wherein the cytoplasmic 
protein contains the amino acid sequence SLGI . 



4. The composition of claim 1, wherein the signal- 
2 5 transducing protein has at its carboxyl terminus the 

amino acid sequence (S/T) -X- (V/I/L) , wherein each - 
represents a peptide bond, each parenthesis encloses 
amino acids which are alternatives to one other, 
each slash within such parentheses separating the 
30 alternative amino acids, and the X represents any 

amino acid which is selected from the group 
comprising the twenty naturally occurring amino 
acids . 



35 



The composition of claim 1, wherein the composition 
comprises an antibody, an inorganic compound, an 
organic compound, a peptide, a peptidomimetic 
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compound, a polypeptide, or a protein. 

6. The composition of claim 5, wherein the peptide 
comprises the sequence (S/T) -X- (V/I/L) -COOH, wherein 

5 each - represents a peptide bond, each parenthesis 

encloses amino acids which are alternatives to one 
other, each slash within such parentheses separating 
the alternative amino acids, the X represents any 
amino acid which is selected from the group 
10 comprising the twenty naturally occurring amino 

acids . 

7. The composition of claim 6, wherein the peptide has 
the amino acid sequence DSENSNFRNEIQSLV . 

15 

8. The composition of claim 6, wherein the peptide has 
the amino acid sequence RNEIQSLV. 

9. The composition of claim 6, wherein the peptide has 

2 0 the amino acid sequence NEIQSLV. 

10. The composition of claim 6, wherein the peptide has 
the amino acid sequence EIQSLV. 

25 11. The composition of claim 6, wherein the peptide has 

the amino acid sequence IQSLV. 

12. The composition of claim 6, wherein the peptide has 
the amino acid sequence QSLV. 

30 

13. The composition of claim 6, wherein the peptide has 
the amino acid sequence SLV. 

14. The composition of claim 6, wherein the peptide has 

3 5 the amino acid sequence IPPDSEDGNEEQSLV . 

15. The composition of claim 6, wherein the peptide has 
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the amino acid sequence DSEMYNFRSQLASW . 

16. The composition of claim 6, wherein the peptide has 
the amino acid sequence IDLASEFLFLSNSFL . 

5 

17. The composition of claim 6, wherein the peptide has 
the amino acid sequence PPTCSQANSGRISTL . 

18. The composition of claim 6, wherein the peptide has 
10 the amino acid sequence SDSNMNMNELSEV. 

19. The composition of claim 6, wherein the peptide has 
the amino acid sequence QNFRTYIVSFV. 

15 20. The composition of claim 6, wherein the peptide has 

the amino acid sequence RETIESTV. 

21. The composition of claim 6, wherein the peptide has 
the amino acid sequence RGFISSLV. 

20 

22. The composition of claim 6, wherein the peptide has 
the amino acid sequence TIQSVI . 

23. The composition of claim 6, wherein the peptide has 

2 5 the amino acid sequence ESLV. 

24. The composition of claim 6, wherein the organic 
compound has the sequence Ac-SLV-COOH, wherein the 
Ac represents an acetyl, each - represent a peptide 

3 0 bond . 

25 . A composition capable of inhibiting specific binding 
between a signal -transducing protein having at its 
carboxyl terminus the amino acid sequence (S/T) -X- 
35 (V/I/L) , wherein each - represents a peptide bond, 

each parenthesis encloses amino acids which are 
alternatives to one other, each slash within such 
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parentheses separating the alternative amino acids, 
the X represents any amino acid which is selected 
from the group comprising the twenty naturally 
occurring amino acids, and a cytoplasmic protein. 

26. The composition of claim 25, wherein the composition 
comprises an antibody, an inorganic compound, an 
organic compound, a peptide, a peptidomimet ic 
compound, a polypeptide or a protein. 



10 



27. A method of identifying a compound capable of 
inhibiting specific binding between a signal - 
transducing protein and a cytoplasmic protein 
containing the amino acid sequence (G/S/A/E) -L-G- 
15 (F/I/L) , wherein each - represents a peptide bond, 

each parenthesis encloses amino acids which are 
alternatives to one other, each slash within such 
parentheses separating the alternative amino acids, 
which comprises: 

20 

(a) contacting the cytoplasmic protein bound to 
the signal -transducing protein with a 
plurality of compounds under conditions 
permitting binding between a known compound 

25 previously shown to be able to displace the 

signal-transducing protein bound to the 
cytoplasmic protein and the bound cytoplasmic 
protein to form a complex; and 

(b) detecting the displaced signal- transducing 
30 protein or the complex formed in step (a) , 

wherein the displacement indicates that the 
compound is capable of inhibiting specific 
binding between the signal -transducing protein 
and the cytoplasmic protein. 



35 



28. The method of claim 27, wherein the inhibition of 
specific binding between the signal- transducing 
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protein and the cytoplasmic protein affects the 
transcription activity of a reporter gene. 

29. The method of claim 28, where in step (b) the 
5 displaced signal -transducing protein or the complex 

is detected by comparing the transcription activity 
of a reporter gene before and after the contacting 
with the compound in step (a) , where a change of the 
activity indicates that the specific binding between 
10 the signal-transducing protein and the cytoplasmic 

protein is inhibited and the signal-transducing 
protein is displaced. 

30. The method of claim 27, wherein the cytoplasmic 
15 protein is bound to a solid support . 

31. The method of claim 27, wherein the compound is 
bound to a solid support. 

20 32. The method of claim 27, wherein the compound 

comprises an antibody, an inorganic compound, an 
organic compound, a peptide, a peptidomimetic 
compound, a polypeptide or a protein. 

25 33. The method of claim 27, wherein the contacting of 

step (a) is in vitro . 

34. The method of claim 27, wherein the contacting of 
step (a) is in vivo . 

30 

35. The method of claim 34, wherein the contacting of 
step (a) is in a yeast cell. 

36. The method of claim 34, wherein the contacting or 
35 step (a) is in a mammalian cell. . 

37. The method of claim 27, wherein the signal- 
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transducing protein is a cell surface receptor. 



The method of claim 27, wherein the signal- 
transducing protein is a signal transducer protein. 

The method of claim 27, wherein the signal - 
transducing protein is a tumor suppressor protein. 

The method of claim 37, wherein the cell surface 
protein is the Fas receptor. 

The method of claim 40, wherein the Fas receptor is 
expressed in cells derived from organs comprising 
the thymus, liver, kidney, colon, ovary, breast, 
testis, spleen, stomach, prostate, uterus, skin, 
head and neck. 

The method of claim 40, wherein the Fas receptor is 
expressed in cells comprising T-cells and B-cells. 

The method of claim 37, wherein the cell-surface 
receptor is the CD4 receptor. 

The method of claim 37, wherein the cell -surface 
receptor is the p75 receptor. 

The method of claim 37, wherein the cell-surface 
receptor is the serotonin 2A receptor. 

The method of claim 37, wherein the cell -surface 
receptor is the serotonin 2B receptor. 

The method of claim 38, wherein the signal 
transducer protein is Protein Kinase-C-o?- type . 

The method of claim 39, wherein the tumor suppressor 
protein is adenomatosis polyposis coli tumor 
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suppressor protein. 

49. The method of claim 39, wherein the tumor suppressor 
protein protein is the colorectal mutant cancer 

5 protein. 

50. The method of claim 27, wherein the cytoplasmic 
protein contains the amino acid sequence SLG1 , 
wherein each - represents a peptide bond, each 

10 parenthesis encloses amino acids which are 

alternatives to one other, and each slash within 
such parentheses separating the alternative amino 
acids . 

15 51. The method of claim 40, wherein the cytoplasmic 

protein is Fas-associated phosphatase- 1 . 

52 . A method of identifying a compound capable of 
inhibiting specific binding between a signal - 

2 0 transducing protein having at its carboxyl terminus 

the amino acid sequence (S/T) -X- (V/I/L) , wherein 
each - represents a peptide bond, each parenthesis 
encloses amino acids which are alternatives to one 
other, each slash within such parentheses separating 
25 the alternative amino acids, the X represents any 

amino acid which is selected from the group 
comprising the twenty naturally occurring amino 
acids, and a cytoplasmic protein, which comprises: 

30 (a) contacting the signal-transducing protein 

bound to the cytoplasmic protein with a 
plurality of compounds under conditions 
permitting binding between a known compound 
previously shown to be able to displace the 

3 5 cytoplasmic protein bound to the signal - 

transducing protein and the bound signal - 
transducing protein to form a complex; and 
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(b) detecting the displaced cytoplasmic protein or 
the complex of step (a) wherein the 
displacement indicates that the compound is 
capable of inhibiting specific binding between 
5 the signal -transducing protein and the 

cytoplasmic protein . 

53. The method of claim 52, wherein the inhibition of 
specific binding between the signal- transducing 

10 protein and the cytoplasmic protein affects the 

transcription activity of a reporter gene. 

54. The method of claim 53, where in step (b) the 
displaced cytoplasmic protein or the complex is 

15 detected by comparing the transcription activity of 

a reporter gene before and after the contacting with 
the compound in step (a) , where a change of the 
activity indicates that the specific binding between 
the signal -transducing protein and the cytoplasmic 

2 0 protein is inhibited and the cytoplasmic protein is 

displaced. 

55. The method of claim 52, wherein the cytoplasmic 
protein is bound to a solid support. 

25 

56. The method of claim 52, wherein the compound is 
bound to a solid support . 

57. The method of claim 52, wherein the compound 
30 comprises an antibody, an inorganic compound, an 

organic compound, a peptide, a peptidomimetic 
compound, a polypeptide or. a protein. 

58. The method of claim 52, wherein the contacting of 
35 step (a) is in vitro . 



59 . 



The method of claim 52, wherein the contacting of 
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step (a) is in vivo . 

The method of claim 59, wherein the contacting of 
step (a) is in a yeast cell. 

The method of claim 59, wherein the contacting or 
step (a) is in a mammalian cell. 

The method of claim 52, wherein the signal- 
transducing protein is a cell surface receptor. 

The method of claim 52, wherein the signal- 
transducing protein is a signal transducer protein. 

The method of claim 52, wherein the signal- 
transducing protein is a tumor suppressor protein. 

The method of claim 62, wherein the cell surface 
protein is the Fas receptor. 

The method of claim 65, wherein the Fas receptor is 
expressed in cells derived from organs comprising 
the thymus, liver, kidney, colon, ovary, breast, 
testis, spleen, stomach, prostate, uterus, skin, 
head and neck . 

The method of claim 65, wherein the Fas receptor is 
expressed in cells comprising T-cells and B-cells. 

The method of claim 62, wherein the cell -surface 
receptor is the CD4 receptor. 

The method of claim 62, wherein the cell-surface 
receptor is the p75 receptor. 

The method of claim 62, wherein the cell-surface 
receptor is the serotonin 2A receptor. 
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71. The method of claim 62, wherein the cell -surface 
receptor is the serotonin 2B receptor. 

5 72. The method of claim 63, wherein the signal 

transducer protein is Protein Kinase-C-or- type . 

73. The method of claim 64, wherein the tumor suppressor 
protein is adenomatosis polyposis coli tumor 

10 suppressor protein. 

74. The method of claim 64, wherein the tumor suppressor 
protein is the colorectal mutant cancer protein. 

15 75. The method of claim 52, wherein the cytoplasmic 

protein contains the amino acid sequence SLGI, 
wherein each - represents a peptide bond, each 
parenthesis encloses amino acids which are 
alternatives to one other, and each slash within 

2 0 such parentheses separating the alternative amino 

acids . 

76. The method of claim 52, wherein the cytoplasmic 
protein is Fas-associated phosphatase- 1 . 

25 

77. A method inhibiting the proliferation of cancer 
cells comprising the composition of claim 1. 

78. The method of claim 77, wherein the cancer cells are 

3 0 derived from organs comprising the thymus, liver, 

kidney, colon, ovary, breast, testis, spleen, 
stomach, prostate, uterus, skin, head and neck. 

79. The method of claim 77, wherein the cancer cells are 
35 derived from cells comprising T-cells and B-cells. 



80. 



A method of inhibiting the proliferation of cancer 
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cells comprising the composition of claim 25. 

The method of claim 80, wherein the cancer cells are 
derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 
stomach, prostate, uterus, skin, head and neck. 

The method of claim 80, wherein the cancer cells are 
derived from cells comprising T-cells and B-cells. 

A method of inhibiting the proliferation of cancer 
cells comprising the compound identified by the 
method of claim 27. 

The method of claim 83, wherein the cancer cells are 
derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 
stomach, prostate, uterus, skin, head and neck. 

The method of claim 83, wherein the cancer cells are 
derived from cells comprising T-cells and B-cells. 

A method of inhibiting the proliferation of cancer 
cells comprising the compound identified by the 
method of claim 52 . 

The method of claim 86, wherein the cancer cells are 
derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 
stomach, prostate, uterus, skin, head and neck. 

The method of claim 86, wherein the cancer cells are 
derived from cells comprising T-cells and B-cells. 

A method of treating cancer in a subject which 
comprises introducing to the subject's cancerous 
cells an amount of the composition of claim 1 



10 



15 
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effective to result in apoptosis of the cells. 

90. The method of claim 89, wherein the cancer cells are 
derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 
stomach, prostate, uterus, skin, head and neck. 

91. The method of claim 89, wherein the cancer cells are 
derived from cells comprising T-cells and B-cells. 

92. A method of treating cancer in a subject which 
comprises introducing to the subject's cancerous 
cells an amount of the composition of claim 25 
effective to result in apoptosis of the cells. 

93. The method of claim 92, wherein the cancer cells are 
derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 
stomach, prostate, uterus, skin, head and neck. 

94. The method of claim 92, wherein the cancer cells are 
derived from cells comprising T-cells and B-cells. 

95. A method of treating cancer in a subject which 
25 comprises introducing to the subject's cancerous 

cells an amount of the compound identified by the 
method of claim 27 effective to allow apoptosis of 
the cells. 

30 96. The method of claim 95, wherein the cancer cells are 

derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 
stomach, prostate, uterus, skin, head and neck. 



20 



35 



97. 



The method of claim 95, wherein the cancer cells are 
derived from cells comprising T-cells and B-cells. 
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98. A method of treating cancer in a subject which 
comprises introducing to the subject's cancerous 
cells an amount of the compound identified by the 
method of claim 52 effective to result in apoptosis 

5 of the cells. 

99. The method of claim 98, wherein the cancer cells are 
derived from organs comprising the thymus, liver, 
kidney, colon, ovary, breast, testis, spleen, 

10 stomach, prostate, uterus, skin, head and neck. 

100. The method of claim 98, wherein the cancer cells are 
derived from cells comprising T-cells and B-cells. 

15 101. A method of inhibiting the proliferation of virally 

infected cells comprising the composition of claim 
1 . 

102. A method of inhibiting the proliferation of virally 
20 infected cells comprising the composition of claim 

25 . 

103. A method of inhibiting the proliferation of virally 
infected cells comprising the compound identified by 

25 the method of claim 27. 

104. A method of inhibiting the proliferation of virally 
infected cells comprising the compound identified by 
the method of claim 52 . 

30 

105. The method of claim 101, wherein the virally 
infected cells comprise Hepatitis B virus, Epstein- 
Barr virus, influenza virus, Papilloma virus. Adeno 
virus, Human T-cell lymphtropic virus, type 1 or 

3 5 HIV. 



106 . 



The method of claim 102, wherein the virally 
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infected cells comprise Hepatitis B virus, Epstein- 
Barr virus, influenza virus, Papilloma virus. Adeno 
virus, Human T-cell lymphtropic virus, type 1 or 
HIV. 

5 

107. The method of claim 103, wherein the virally 
infected cells comprise Hepatitis B virus, Epstein- 
Barr virus, influenza virus, Papilloma virus. Adeno 
virus, Human T-cell lymphtropic virus, type 1 or 
10 HIV. 



108. The method of claim 104, wherein the virally 
infected cells comprise Hepatitis B virus, Epstein- 
Barr virus, influenza virus, Papilloma virus. Adeno 
15 virus, Human T-cell lymphtropic virus, type 1 or 

HIV. 



109. A method of treating a virally-inf ected subject 
which comprises introducing to the subject's 

20 virally- infected cells the composition of claim 1 

effective to result in apoptosis of the cells. 

110. A method of treating a virally-inf ected subject 
which comprises introducing to the subject's virally 

25 infected cells the composition of claim 25 effective 

to result in apoptosis of the cells. 



111. A method of treating a virally-inf ected subject 
which comprises introducing to the subject's 

30 virally-inf ected cells an amount of the compound 

identified by the method of claim 27 effective to 
result in apoptosis of the cells. 

112. A method of treating a virally- infected subject 
3 5 which comprises introducing to the subject's 

virally- infected cells an amount of the compound 
identified by the method of claim 52 effective to 
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result in apoptosis of the cells. 



113. The method of claim 109, wherein the virally 
infected cells comprise the Hepatitis B virus, 
Epstein-Barr virus, influenza virus, Papilloma 
virus. Adeno virus, Human T-cell lymphtropic virus, 
type 1 or HIV. 



114. The method of claim 110, wherein the virally 
10 infected cells comprise the Hepatitis B virus, 

Epstein-Barr virus, influenza virus, Papilloma 
virus. Adeno virus, Human T-cell lymphtropic virus, 
type 1 or HIV. 



15 115. The method of claim 111, wherein the virally 

infected cells comprise the Hepatitis B virus, 
Epstein-Barr virus, influenza virus, Papilloma 
virus. Adeno virus, Human T-cell lymphtropic virus, 
type 1 or HIV. 

20 

116. The method of claim 112, wherein the virally 
infected cells comprise the Hepatitis B virus, 
Epstein-Barr virus, influenza virus, Papilloma 
virus. Adeno virus, Human T-cell lymphtropic virus, 
25 type 1 or HIV. 



117. A pharmaceutical composition comprising the 
composition of claim 1 in an effective amount and a 
pharmaceutically acceptable carrier. 

30 

118. A pharmaceutical composition comprising the 
composition of claim 25 in an effective amount and 
a pharmaceutically acceptable carrier. 



35 



119 . 



A pharmaceutical composition comprising the compound 
identified by the method of claim 27 in an effective 
amount and a pharmaceutically acceptable carrier. 



-77- 
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A pharmaceutical composition comprising the compound 
identified by the method of claim 52 in an effective 
amount and a pharmaceutically acceptable carrier. 
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FIG. 7H 

1 maaasydqll kqvealkmen snlrqeledn snhltklete asnmkevlkq Iqgsiedeam 
61 assgqidlle rlkelnldss nfpgvklrsk mslrsygsre gsvssrsgec spvpmgsfpr 
121 rgfvngsres tgyleeleke rsllladldk eekekdwyya qlqnltkrid slpltenfsl 
181 qtdmtrrqle yearqirvam eeqlgtcqdm ekraqrriar iqqiekdilr irqllqsqat 
241 eaerssqnkh etgshdaerq negqgvgein matsgngqgs ttrmdhetas vlssssthsa 
301 prrltshlgt kvemvyslls mlgthdkddm srtllamsss qdscismrqs gclplliqll 
361 hgndkdsvll gnsrgskear arasaalhni ihsqpddkrg rreirvlhll eqiraycetc 
421 wewqeahepg mdqdknpmpa pvehqicpav cvlmklsfde ehrhamnelg glqaiaellq 
481 vdcemygltn dhysitlrry agmaltnltf gdvankatlc smkgcmralv aqlksesedl 
541 qqviasvlrn Iswradvnsk ktlrevgsvk almecalevk kestlksvls alwnlsahct 
601 enkadicavd galaflvgtl tyrsqtntla iiesgggilr nvssliatne dhrqilrenn 
661 clqtllqhlk shsltivsna cgtlwnlsar npkdqealwd mgavsmlknl ihskhkmiam 
721 gsaaalrnlm anrpakykda nimspgsslp slhvrkqkal eaeldaqhls etfdnidnls 
781 pkashrskqr hkqslygdyv fdtnrhddnr sdnfntgnmt vlspylnttv Ipsssssrgs 
841 Idssrsekdr slerergigl gnyhpatenp gtsskrglqi sttaaqiakv meevsaihts 
901 qedrssgstt elhcvtdern alrrssaaht hsntynftks ensnrtcsmp yakleykrss 
961 ndslnsvsss dgygkrgqmk psiesysedd eskfcsygqy padlahkihs anhmddndge 
1021 Idtpinyslk ysdeqlnsgr qspsqnerwa rpkhiiedei kqseqrqsrn qsttypvyte 
1081 stddkhlkfq phfgqqecvs pyrsrgangs etnrvgsnhg inqnvsqslc qeddyeddkp 
1141 tnyserysee eqheeeerpt nysikyneek rhvdqpidys Ikyatdipss qkqsfsfsks 
1201 ssgqsskteh mssssentst pssnakrqnq Ihpssaqsrs gqpqkaatck vssinqetiq 
1261 tycvedtpic fsrcsslssl ssaedeigcn qttqeadsan tlqiaeikek igtrsaedpv 
1321 sevpavsqhp rtkssrlqgs slssesarhk avefssgaks psksgaqtpk sppehyvqet 
1381 plmfsrctsv ssldsfesrs iassvqsepc sgmvsgiisp sdlpdspgqt mppsrsktpp 
1441 pppqtaqtkr evpknkapta ekresgpkqa avnaavqrvq vlpdadtllh fatestpdgf 
1501 scssslsals Idepfiqkdv elrimppvqe ndngnetese qpkesnenqe keaektidse 
1561 kdllddsddd dieileecii samptkssrk akkpaqtask Ipppvarkps qlpvykllps 
1621 qnrlqpqkhv sftpgddmpr vycvegtpin fstatslsdl tiesppnela agegvrggaq 
1681 sgefekrdti ptegrstdea qggktssvti pelddnkaee gdilaecins ampkgkshkp 
1741 frvkkimdqv qqasasssap nknqldgkkk kptspvkpip qnteyrtrvr knadsknnln 
1801 aervfsdnkd skkqnlknns kdfndklpnn edrvrgsfaf dsphhytpie gtpycfsrnd 
1861 slssldfddd dvdlsrekae Irkakenkes eakvtshtel tsnqqsankt qaiakqpinr 
1921 gqpkpilqkq stfpqsskdi pdrgaatdek Iqnfaientp vcfshnssls slsdidqenn 
1981 nkenepiket eppdsqgeps kpqasgyapk sfhvedtpvc fsrnsslssl sidseddllq 
2041 ecissampkk kkpsrlkgdn ekhsprnmgg ilgedltldl kdiqrpdseh glspdsenfd 
2101 wkaiqegans ivsslhqaaa aaclsrqass dsdsilslks gislgspfhl tpdqeekpft 
2161 snkgprilkp gekstletkk ieseskgikg gkkvykslit gkvrsnseis gqmkqplqan 
2221 mpsisrgrtm ihipgvrnss sstspvskkg pplktpasks psegqtatts prgakpsvks 

2281 elspvarqts qiggsskaps rsgsrdstps rpaqqplsrp iqspgrnsis pgrngisppn 

2341 klsqlprtss pstastkssg sgkmsytspg rqmsqqnltk qtglsknass iprsesaskg 

2401 Inqmnngnga nkkvelsrms stkssgsesd rserpvlvrq stfikeapsp tlrrkleesa 

2461 sfeslspssr pasptrsqaq tpvlspslpd mslsthssvq aggwrklppn Isptieyndg 

2521 rpakrhdiar shsespsrlp inrsgtwkre hskhssslpr vstwrrtgss ssilsasses 

2581 sekaksedek hvnsisgtkq skenqvsakg twrkikenef sptnstsqtv ssgatngaes 

2641 ktliyqmapa vsktedvwvr iedcpinnpr sgrsptgntp pvidsvseka npnikdskdn 

2701 qakqnvgngs vpmrtvglen rlnsfiqvda pdqkgteikp gqnnpvpvse tnessivert 

2761 pfsssssskh sspsgtvaar vtpfnynpsp rkssadstsa rpsqiptpvn nntkkrdskt 
2821 dstessgtqs pkrhsgsylv tsv 
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FIG. 10 
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FIG, 11 B 
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